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start 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: June 6, 2006, 20:17:19 ; Search time 4150 Seconds 

(without alignments) 
8583.281 Million cell updates/sec 

Title: US-09-869-098A-l_COPY_1634_2270 
Perfect score: 637 

Sequence: 1 acctgttgctgcagttctga cgcccggctcgtccgacgcg 637 



Scoring table: I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



48236798 seqs, 27959665780 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



96473596 



Post-processing: 



Minimum Match 
Maximum Match 
Listing first 



0% 

100% 

45 summaries 



Database : EST:* 



1 




gb_estl : * 


2 




gb_est3:* 


3 




gb_est4 :* 


4 




gb_est5 :* 


5 




gb_est6:* 


6 




gb_htc : * 


7 




gb_est2 :* 


8 




gb_est7 :* 


9 




gb_est8:* 


10: 


gb_est9: * 


11: 


gb_gssl: * 


12: 


gb_gss2:* 


13: 


gb_gss3: * 


14: 


gb_gss4 : * 



Pred. No. is the number of results predicted by chance to have a 
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score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
DB045257 

LOCUS DB045257 558 bp mRNA linear EST ll-NOV-2005 
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DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
PUBMED 
COMMENT 



FEATURES 

source 



DB045257 TESTI2 Homo sapiens cDNA clone TESTI2033446 5', mRNA 

sequence. 

DB045257 

DB045257.1 GI:82062586 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
1 (bases 1 to 558) 

Kimura,K., Wakamatsu, A. , Suzuki, Y . , Ota,T., Nishikawa, T . , 
Yamashita, R. , Yamamoto, J. , Sekine,M. , Tsuritani, K. , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono,Y., Irie,R., Kushida,N., 
Yoneyama,T., Otsuka,R., Kanda,K., Yokoi,T., Kondo,H., Wagatsuma, M. , 
Murakawa, K. , Ishida, S . , Ishibashi, T. , Takahashi-Fuj ii, A. , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai,T. and Sugano,S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
16344560 

Contact: Takao Isogai 
FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Tel: 81-438-52-3975 
Fax: 81-438-52-3986 ' 
Email: flj-cdna@nifty.com 

NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan) ; cDNA library construction: 
Helix Research Institute (HRI); 5 ' -end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3 1 -end one 
pass sequencing: RAB. 

Location/Qualifiers 

1. .558 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="TESTI203344 6" 
/tissue_type=" testis" 
/ c 1 o ne_l ib- " T ES T 1 2 " 
/note="Vector : pME18SFL3" 



ORIGIN 



Query Match 71.9%; 
Best Local Similarity 99.8%; 
Matches 469; Conservative 



Score 458; DB 9; Length 558; 
Pred. No. 3.1e-95; 
0 ; Mi sma t c he s 0 ; I nd e 1 s 



1 ; Gaps 



1; 



Qy 



Db 



168 AAAAGAGGAAGT GC AC T TAAGAC ACGGC CCC GC TGGACGC T TGT TAGAAACC GTC C TGGC 227 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 AAAAGAGGAAGTGC AC T TAAGAC AC GGC C C C GC TGGACGC - TGT T AGAAACCGT C C TGGC 59 



Qy 

Db 

Qy 

Db 



228 TGGGAAGGCAAGAGG TGTGTGAC T GGAC AAGAC TTGTTTCT GGC GGT C AGTC T TGCCATC 287 
I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
60 TGGGAAGGCAAGAGGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATC 119 

288 C TC AC AGAGGT TGGC GGC CC GAGAGAGT GTGAGGC AGAGGCGGGGAGT GGC AAGGGAGTG 347 

M I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
120 C TC AC AG AGGT TGGC GGC CC GAGAGAGT GTGAGGC AGAGGCGGGGAGT GGC AAGGGAGTG 179 
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Qy 348 ACCATC T CGGGGAAC GAAGGAGTAAACGCGGT GATGGGAC GCACGGAAACGGGAGT GGAG 407 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M | I I I 
Db 180 ACCATC T C GGGGAAC GAAGGAGTAAACGCGGT GATGGGAC GC AC GGAAACGGGAGT GGAG 239 

Qy 408 AAAGTCATGGAGAGAACCC TAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGC TCCAGGGT 467 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 240 AAAGTC AT GGAGAGAAC C C TAGGC GGGGCGGT CC CC GC GGAAAGGC GGC T GC TC C AGGGT 299 

Qy 468 CTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCC 527 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 300 CTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCC 359 

Qy 528 CCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCT 587 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 360 CCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCT 419 

Qy 588 GCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 420 GCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 4 69 



RESULT 2 
DB047138 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
PUBMED 
COMMENT 



FEATURES 



DB047138 5 60 bp mRNA linear EST ll-NOV-2005 

DB047138 TESTI2 Homo sapiens cDNA clone TESTI2035855 5 f , mRNA 
sequence . 
DB047138 

DB047138.1 GI: 821304 61 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
1 (bases 1 to 560) 

Kimura,K., Wakamatsu, A. , Suzuki, Y . , Ota,T., Nishikawa, T . , 
Yamashita, R. , Yamamoto,J., Sekine,M., Tsuritani, K. , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono,Y., Irie,R., Kushida,N., 
Yoneyama,T., Otsuka,R., Kanda,K., Yokoi,T., Kondo,H., Wagatsuma, M. , 
Murakawa, K. , Ishida, S . , Ishibashi, T. , Takahashi-Fujii, A. , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai,T. andSugano,S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
16344560 

Contact: Takao Isogai 
FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Tel: 81-438-52-3975 
Fax: 81-438-52-3986 
Email : flj-cdna@nifty.com 

NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan); cDNA library construction: 
Helix Research Institute (HRI); 5 ! -end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3* -end one 
pass sequencing: RAB. 

Location/Qualifiers 
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source 1. .560 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone="TESTI2035855" 
/tissue_type=" testis" 
/clone_lib="TESTI2" 
/note="Vector: pME18SFL3" 

ORIGIN 

Query Match 68.9%; Score 439; DB 9; Length 560; 

Best Local Similarity 99.6%; Pred. No. 7.6e-91; 

Matches 450; Conservative 0; Mismatches 1; Indels 1; Gaps 1; 

Qy 186 AAGACACGGCCCCGCTGGACGCTTGTTAGAAACCGTCCTGGCTGGGAAGGCAAGAGGTGT 245 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I ! I II I I I I I I I I 
Db 1 AAGACACGGCCCCGC TGGACGC -TGTTAGAAACCGTCCTGGCTGGGAAGGCAAGAGGTGT 59 

Qy 246 GTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTGGCGGC 305 

I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 60 GTGAC TGGAC AAGAC T TGT T T C TGNC GG TC AGTC T T GC CAT C C T CAC AGAGGT TGGC GGC 119 

Qy 306 CC GAGAGAGTGTGAGGCAGAGGC GGGGAGTGGC AAGGGAG TGAC CAT C TCGGGGAAC GAA 365 

I I I I I I I I I I I I I II I I I II I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 120 CCGAGAGAGTG TGAGGC AGAGGC GGGGAGTGGC AAGGGAG TGAC CATC T CGGGGAACGAA 179 

Qy 366 GGAGTAAAC GC GGTGATGGGACGC ACGGAAAC GGGAGT GGAGAAAGTC ATGGAGAGAACC 425 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I 
Db 180 GGAGTAAAC GC GGTGATGGGACGC AC GGAAAC GGGAGT GGAGAAAGT C ATGGAGAGAACC 239 

Qy 426 CTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGA 485 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 240 CTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGA 299 

Qy 486 GCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAG 545 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 300 GCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAG 359 

Qy 546 CCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTG 605 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 360 CCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTG 419 

Qy 606 ACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I 
Db 420 ACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 451 



RESULT 3 
DA890833 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 



DA890833 559 bp mRNA linear EST 10-NOV-2005 

DA890833 RECTM2 Homo sapiens cDNA clone RECTM2000758 5', mRNA 
sequence . 
DA890833 

DA890833.1 GI: 81452209 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo. 
1 (bases 1 to 559) 
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AUTHORS 



TITLE 



JOURNAL 
PUBMED 
COMMENT 



FEATURES 

source 



Kimura,K., Wakamatsu, A. , Suzuki, Y . , Ota,T., Nishikawa, T . , 
Yamashita, R. , Yamamoto, J. , Sekine,M., Tsuritani, K. , Wakaguri,H., 
Ishii,S., Sugiyama, T . , Saito,K., Isono,Y., Irie,R., Kushida,N., 
Yoneyama,T., Otsuka,R., Kanda,K., Yokoi,T., Kondo,H., Wagatsuma, M. , 
Murakawa, K. , Ishida, S . , Ishibashi, T. , Takahashi-Fuj ii , A. , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai,T. and Sugano f S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
16344560 

Contact: Takao Isogai 
FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Tel: 81-438-52-3975 
Fax: 81-438-52-3986 
Email: flj-cdna@nifty.com 

NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan) ; cDNA library construction: 
Helix Research Institute (HRI); 5 ' -end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3' -end one 
pass sequencing: RAB. 

Location/Qualifiers 

1. .559 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon:9606" 
/clone="RECTM2000758" 
/tissue_type=" rectum" 
/clone_lib="RECTM2 " 
/note="Vector : pME18SFL3" 



ORIGIN 



Query Match 63.8%; 
Best Local Similarity 99.5%; 
Matches 418; Conservative 



Score 40 6.4; DB 9; 
Pred. No. 2.6e-83; 
0; Mismatches 1; 



Length 559; 



Indels 



1 ; Gaps 



1; 



Qy 


213 


Db 


1 


Qy 


273 


Db 


61 


Qy 


332 


Db 


121 


Qy 


392 


Db 


181 


Qy 


452 


Db 


241 


Qy 


512 



AGAAAC C GT C C T GGC TGGGAAGGC AAGAGGTG TGTGACTGGAC AAGAC T T GT T T C T GGCG 272 
I II I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGAAAC C GT C C T GGC TGGGAAGGC AAGAGGT G TGTGAC T GGAC AAGAC T T GT T TC T GGCG 60 

GTC AGTC T T GC CATC C TC AC AGAGGT TGGC - GGC CC GAGAGAGT GTGAGGC AGAGGCGGG 331 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTC AGTC T TGC CATC C TC AC AGAGGT TGGCAGGC CC GAGAGAGT GTGAGGC AGAGGCGGG 120 

GAG TGGC AAGGGAGT GAC CAT C TC GGGGAACGAAGGAGTAAAC GC GGT GATGGGAC GC AC 391 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

GAG TGGC AAGGGAGT GACC AT C TC GGGGAACGAAGGAGTAAAC GCGGT GAT GGGAC GC AC 180 

GGAAACGGGAG TGGAGAAAGTCAT GGAGAGAAC C C T AGGC GGGGC GGT CCC C GCGGAAAG 451 

I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
GGAAAC GGGAG TGGAGAAAGTCAT GGAGAGAAC C C T AGGC GGGGCGGT CCC C GC GGAAAG 240 

GCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCA 511 
I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
GCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCA 300 

GGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCC 571 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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suuKt, searcn Kesuits uetaiis tor Application uyaoyuys ana searcn Kesuit us-uy-st>y-u... fage / or iy 



Db 

Qy 
Db 



301 GGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCC 360 



572 



631 



ACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
361 ACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTTC 420 



RESULT 4 
DB045802 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
PUBMED 
COMMENT 



FEATURES 

source 



ORIGIN 



DB045802 314 bp mRNA linear EST ll-NOV-2005 

DB045802 TESTI2 Homo sapiens cDNA clone TESTI2034168 5', mRNA 
sequence. 
DB045802 

DB045802.1 GI: 82069361 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
1 (bases 1 to 314) 

Kimura,K., Wakamatsu, A. , Suzuki, Y . , Ota,T., Nishikawa, T . , 
Yamashita,R., Yamamoto,J., Sekine,M., Tsuritani, K. , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono,Y., Irie,R., Kushida,N., 
Yoneyama,T., Otsuka,R., Kanda,K., Yokoi,T., Kondo,H. , Wagatsuma, M. , 
Murakawa, K. , Ishida, S . , Ishibashi, T . , Takahashi-Fuj ii , A. , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai,T. and Sugano,S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
16344560 

Contact: Takao Isogai 
FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Tel: 81-438-52-3975 
Fax: 81-438-52-3986 
Email : flj-cdna@nifty.com 

NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan); cDNA library construction: 
Helix Research Institute (HRI); 5 ! -end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3 '-end one 
pass sequencing: RAB. 

Location/Qualifiers 

1. .314 

/organism="Homo sapiens" 
/mol_type= "mRNA" 
/db_xref="taxon:9606" 
/clone="TESTI2034168" 
/tissue_type=" testis" 
/clone_lib="TESTI2" 
/note="Vector : pME18SFL3" 



Query Match 47.3%; Score 301; DB 9; Length 314; 

Best Local Similarity 99.4%; Pred. No. 5.9e-59; 

Matches 312; Conservative 0; Mismatches 1; Indels 1; Gaps 



1; 



Qy 



213 AGAAACC GTCCTGGC TGGGAAGGCAAGAGGTGTGTGAC TGGACAAGAC TTGT TTCTGGCG 272 
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sluku searcn Kesuits Details tor Application uysoyuvs ana searcn Kesuit us-uy-8t>y-u... fage s ot iy 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

Db 1 AGAAACCGTCCTGGC TGGGAAGGCAAGAGGTGTGTGACTGGACAAGAC TTGT TTCTGGCG 60 

Qy 273 GT C AGT C T TGCC ATC C TC AC AGAGGT TGGCGGCC C GAGAGAGTG TGAGGC AGAGGC GGGG 332 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 GTC AGTC T TGC CAT CCTC AC AGAGGT TGGCG^GCCCGAGAGAGTG TGAGGC AGAGGC GGGG 120 

Qy 333 AGT GGCAAGGGAGTGAC CAT C TC GGGGAAC GAAGGAGTAAACGC GGTG ATGGGACGCACG 392 

I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 121 AGT GGCAAGGGAGTGAC CATC TCGGGGAAC GAAGGAGTAAAC GCGGTGATGGGACGC AC G 180 

Qy 393 GAAACG - GGAGT GGAGAAAGTC AT GGAGAGAACC C T AGGC GGGGC GGTC CC C GC GGAAAG 451 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 181 GAAAC GT GGAGT GGAGAAAGTC AT GGAGAGAACC C T AGGC GGGGC GGT C C C C GC GGAAAG 240 

Qy 452 GCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCA 511 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II 
Db 241 GCGGCTGCTCCAGGGTCTNCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCA 300 

Qy 512 GGCCCCACCCCGGG 525 

I I I I I I I I I I I I I I 
Db 301 GGCCCCACCCCGGG 314 



RESULT 5 
BI222607 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BI222607 427 bp mRNA linear EST ll-JUL-2001 

602940387F1 NIH_MGC_12 Homo sapiens cDNA clone IMAGE: 5103573 5*, 
mRNA sequence. 
BI222607 

BI222607 .1 GI : 14 676051 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 

Hominidae ; Homo . 

1 (bases 1 to 427) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 

Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: ATCC 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: Incyte Genomics, Inc. 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : //image . llnl . gov 
Plate: LLAM11249 row: n column: 22 
High quality sequence stop: 348. 
Location/Qualifiers 
1. .427 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xr e f = " taxon : 9 60 6 " 
/ c lone= " IMAGE :5103573" 

/tissue_type="cervical carcinoma cell line" 

/lab_host="DH10B" 

/clone lib="NIH MGC 12" 
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suukii searcn Results Details tor Application uysbyuys ana search Kesuit us-uy-»t>y-u ... I'age y or iy 



/note="Organ: cervix; Vector: pCMV-SPORT6; Site_l: NotI; 
Site_2: Sail; Cloned unidirectionally . Primer: Oligo dT. 
Average insert size 1.4 kb. Library prepared by Life 
Technologies . " 

ORIGIN 

Query Match 46.5%; Score 296.4; DB 2; Length 427; 

Best Local Similarity 93.0%; Pred. No. 6*8e-58; 

Matches 333; Conservative 0; Mismatches 21; Indels 4; Gaps 2; 



Qy 152 GGGGGGCTGGGCCCATAAAAGAGGAAGTGCAC TTAAGACACGGCCCCGCTGGACGC TTGT 211 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I III 
Db 1 GGGGGGC TGGGCCCATAAAAGAGGAAGTGCAC TTAAGACACGGCCCCGC TGTGACGC TGT 60 

Qy 212 TAGAAAC CGTC C TGGC TGGGAAGGCAAGAGGT GTGT GAC T GGAC AAGAC TTGTTTCT GGC 271 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 AAGAAAC CGTC C TGGC T GGGAAGGCAAGAGGT GTGT GAC T GGAC AAGAC TTGTTTCT GGC 120 

Qy 272 GGT C AGT C T TGC CAT C C TC AC AGAGGT T GGC GGC CC GAGAGAGTGTGAGGC AGAGGCGGG 331 

I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 121 GGT C AGT CT TGC CAT C C TC AC AGAGGT T GGCGGC C C GAGAGAGTGT GAGGC AGAGGC GGG 180 

Qy 332 GAGTGGC AAGGGAGT GACCATCTC GGGGAACGAAGGAGTAAACGCGGT GATGGGAC GCAC 391 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 GAGTGGC AAGGGAGT GACCATCTC GGGGAACGAAGGAGTAAACGCGGT GATGGGAC GCAC 240 

Qy 392 GGAAAC GGGAGT GGAGAAAGTC ATGGAGAGAACCC T AGGC GGGGC GGT C CC C G — CGGAA 449 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II 
Db 241 GGAAAC GGGAGT GGAGAAAGTC AT GGAGAGAACCC T AGGC GGGGCGGT C CC CGACGGAAA 300 

Qy 450 AGGCGGCTGCTCCA — GGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGC 505 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 301 GTGCGGCTGCTCCATGGGTCTCCTGCACCCAAGTAGGAGCTGGCAGGCCACGTGCCAC 358 



RESULT 6 
DA359011 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
PUBMED 
COMMENT 



DA359011 534 bp mRNA linear EST 06-NOV-2005 

DA359011 BRSTN2 Homo sapiens cDNA clone BRSTN2004725 5 ? , mRNA 
sequence . 
DA359011 

DA359011.1 GI: 80897832 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo . 
1 (bases 1 to 534) 

Kimura,K., Wakamatsu, A. , Suzuki, Y . , Ota,T., Nishikawa, T . , 
Yamashita,R. , Yamamoto,J., Sekine,M., Tsuritani, K. , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono,Y., Irie,R., Kushida,N., 
Yoneyama,T., Otsuka,R., Kanda,K., Yokoi,T., Kondo,H., Wagatsuma, M. , 
Murakawa, K. , Ishida, S. , Ishibashi, T . , Takahashi-Fujii, A. , 
Tanase,T., Naga^K., Kikuchi^., Nakai,K., Isogai f T. and Sugano f S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
16344560 

Contact: Takao Isogai 



http://es/ScoreAccessWeb/GetItem.action?AppId=09869098&seqId=532385&ItemNa^ 6/7/2006 
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FEATURES 

source 



FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Tel: 81-438-52-3975 
Fax: 81-438-52-3986 
Email: flj-cdna@nifty.com 

NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan) ; cDNA library construction: 
Helix Research Institute (HRI); 5 ' -end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3' -end one 
pass sequencing: RAB. 

Location/Qualifiers 

1. .534 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone="BRSTN2004725" 
/tissue_type=" subthalamic nucleus" 
/clone_lib= " BRSTN2 " 
/note="Vector : pME18SFL3" 



ORIGIN 



Query Match 16.9%; 
Best Local Similarity 98.3%; 
Matches 119; Conservative 



Score 107.4; DB 9; 
Pred. No. 3.3e-14; 
0; Mismatches 1; 



Length 534; 



Indels 



1; Gaps 



1; 



Db 



518 ACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCG 577 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 ACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCG 60 



Qy 

Db 

Qy 

Db 



578 AAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGC - TCGCCCGGC TCGTCCGACGC 636 
I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
61 AAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTTCGCCTGGC TCGTCCGACGC 120 

637 G 637 
I 

121 G 121 



RESULT 7 
BP305626 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
COMMENT 



BP305626 630 bp mRNA linear EST 17-SEP-2004 

BP305 626 Sugano cDNA library, macrophage Homo sapiens cDNA clone 
MPG05177, mRNA sequence. 
BP305626 

BP305626.1 GI: 52234586 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo . 
1 (bases 1 to 630) 

Suzuki, Y. , Yamashita, R. , Shirota,M., Sakakibara, Y. , Chiba, J., 
Mizushima-Sugano, J. , Nakai,K. and Sugano, S. 

Sequence comparison of human and mouse genes reveals a homologous 
block structure in the promoter regions 
Genome Res. 14 (9), 1711-1718 (2004) 
15342556 

Contact: Yutaka Suzuki 
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FEATURES 

source 



ORIGIN 



Department of Virology 

Institute of Medical Science, University of Tokyo 
4-6-1, Shirokanedai, Minatoku, Tokyo 108-8639, Japan 
Tel: 81-3-5449-5343 
Fax: 81-3-5449-5416 
Email : ysuzuki@hgc . jp . 

Location/Qualifiers 

1. .630 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="MPG05177" 
/cell_type= "macrophage" 

/clone_lib="Sugano cDNA library, macrophage" 



Query Match 15.9%; Score 101; DB 3; Length 630; 

Best Local Similarity 100.0%; Pred. No. le-12; 

Matches 101; Conservative 0; Mismatches 0; Indels 0; 



Qy 



Db 



Gaps 



0; 



537 AGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCC 596 
M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 AGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCC 60 



Qy 



Db 



597 TTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
61 TTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 101 



RESULT 8 
DB194182 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
PUBMED 
COMMENT 



DB194182 346 bp mRNA linear EST 03-DEC-2005 

DB194182 TOVAR2 Homo sapiens cDNA clone TOVAR2002676 5', mRNA 
sequence . 
DB194182 

DB194182 .1 GI: 83152118 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo. 
1 (bases 1 to 346) 

Kimura,K., Wakamatsu, A. , Suzuki, Y., Ota,T., Nishikawa, T . , 
Yamashita,R. , Yamamoto,J., Sekine,M. , Tsuritani, K. , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono,Y., Irie,R., Kushida,N., 
Yoneyama,T., Otsuka,R., Kanda,K., Yokoi,T., Kondo,H., Wagatsuma,M. , 
Murakawa, K. , Ishida, S. , Ishibashi, T . , Takahashi-Fujii, A. , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai,T. and Sugano,S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
16344560 

Contact: Takao Isogai 
FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Tel: 81-438-52-3975 
Fax: 81-438-52-3986 
Email : flj-cdna@nifty.com 
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SLUKfc, searcn Results uetaiis tor Application uy»t>yuys ana searcn Kesuit us-uy-st>y ... fage izoi iy 



NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan); cDNA library construction: 
Helix Research Institute (HRI); 5 ' -end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3' -end one 
pass sequencing: RAB. 
FEATURES Location/Qualifiers 
source 1. .346 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon:9606" 

/clone="TOVAR2002 676" 

/tissue_type="ovary, tumor tissue" 

/ c 1 o ne_l ib= " TOVAR 2 " 

/note="Vector : pME18SFL3" 

ORIGIN 

Query Match 14.9%; Score 95; DB 9; Length 346; 

Best Local Similarity 100.0%; Pred. No. 2.4e-ll; 

Matches 95; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 543 AAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGA 602 

M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 1 AAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGA 60 

Qy 603 TTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 61 TTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 95 



RESULT 9 
DA903915 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
PUBMED 
COMMENT 



DA903915 566 bp mRNA linear EST 13-NOV-2005 

DA903915 SKNMC2 Homo sapiens cDNA clone SKNMC2004170 5', mRNA 
sequence . 
DA903915 

DA903915. 1 GI: 82 3597 45 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
1 (bases 1 to 566) 

Kimura,K., Wakamatsu, A. , Suzuki, Y. , Ota,T., Nishikawa, T . , 
Yamashita,R. , Yamamoto,J., Sekine,M. , Tsuritani, K. , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono,Y., Irie,R., Kushida,N., 
Yoneyama,T., Otsuka,R., Kanda,K., Yokoi,T., Kondo,H., Wagatsuma,M. , 
Murakawa, K. , Ishida , S . , Ishibashi , T . , Takahashi-Fuj ii , A. , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai,T. andSugano,S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
16344560 

Contact: Takao Isogai 
FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Tel: 81-438-52-3975 
Fax: 81-438-52-3986 



http://es/ScoreAccessWeb/Gefltem.action?AppId=09869098&seqId=532385&Item 6/7/2006 



suuKfc, searcn Kesuits uetaiis tor Application uysoyuys ana searcn Kesuit us-uy-soy... Fage i j ot iy 



Email : f 1 j -cdna@nif ty . com 

NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan); cDNA library construction: 
Helix Research Institute (HRI); 5 1 -end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3' -end one 
pass sequencing: RAB. 
FEATURES Location/Qualifiers 
source 1. .566 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon:9 606" 

/clone="SKNMC2004170" 

/cell_type="neuroepithelioma" 

/cell_line="SK-N-MC" 

/clone_lib="SKNMC2" 

/note="Vector : pME18SFL3" 

ORIGIN 

Query Match 14.9%; Score 95; DB 9; Length 566; 

Best Local Similarity 100.0%; Pred. No. 2.4e-ll; 

Matches 95; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 543 AAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGA 602 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 AAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGA 60 

Qy 603 TTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I II I I 1 
Db 61 TTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 95 



RESULT 10 

DA821600 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
PUBMED 
COMMENT 



DA821600 574 bp mRNA linear EST ll-NOV-2005 

DA821600 PERIC2 Homo sapiens cDNA clone PERIC2001607 5 T , mRNA 
sequence . 
DA821600 

DA821600. 1 GI: 821092 69 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
1 (bases 1 to 574) 

Kimura,K., Wakamatsu, A. , Suzuki, Y . , Ota,T., Nishikawa, T . , 
Yamashita,R. , Yamamoto,J., Sekine,M., Tsuritani, K. , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono,Y., Irie,R., Kushida,N., 
Yoneyama,T., Otsuka,R., Kanda,K., Yokoi,T., Kondo,H., Wagatsuma, M. , 
Murakawa, K. , Ishida, S . , Ishibashi, T . , Takahashi-Fuj ii, A. , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai,T. andSugano,S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
16344560 

Contact: Takao Isogai 
FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
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Tel: 81-438-52-3975 
Fax: 81-438-52-3986 
Email : flj-cdna@nifty.com 

NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan) ; cDNA library construction: 
Helix Research Institute (HRI) ; 5'-end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3 ' -end one 
pass sequencing: RAB. 
FEATURES Location/Qualif iers 

source 1. .574 

/organism="Homo sapiens" 

/mol_type= "mRNA" 

/db_xref="taxon:9606" 

/clone=" PERIC2001607" 

/tissue_type="pericardium" 

/clone_lib=" PERIC2 " 

/note="Vector : pME18SFL3" 

ORIGIN 

Query Match 14.8%; Score 94; DB 9; Length 574; 

Best Local Similarity 100.0%; Pred. No. 4.1e-ll; 

Matches 94; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 544 AGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGAT 603 - 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 1 AGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGAT 60 

Qy 604 TGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 61 TGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 94 



RESULT 11 

CR991281 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
COMMENT 



CR991281 326 bp mRNA linear EST 28-JUN-2005 

CR991281 RZPD no. 9017 Homo sapiens cDNA clone R2PDp9017D208 5', 
mRNA sequence. 
CR991281 

CR991281.1 GI: 68285166 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
1 (bases 1 to 326) 

Heil,0., Ebert,L., Hennig,S., Henze,S., Radelof,U., Schneider, D. 
and Korn, B. 

Human T-Lymphocytes library 
Unpublished (2005) 
Contact: Inge Arlart 

RZPD Deutsches Ressourcenzentrum fuer Genomf orschung GmbH 
Heubnerweg 6, D-14059 Berlin, Germany 
Email: www.rzpd.de 
RZPD; RZPDp9017D208. 

RZPDLIB; (Human T-Lymphocytes) RZPD LIB No. 9017 

http: //www. rzpd.de/cgi-bin/products/set . cgi?libNo=9017 Contact: 
Inge Arlart 

RZPD Deutsches Ressourcenzentrum fuer Genomf orschung GmbH 
Heubnerweg 6, D-14059 Berlin, Germany 
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Tel: +49 30 32639 100 
Fax: +49 30 32639 111 
www . r zpd . de 

This clone is available from RZPD; 

http://www.rzpd.de/cgi- bin/products /cl . cgi?CloneID=RZPDp9017D208 
contact RZPD (product- support@rzpd.de) for further information. 
Primer name: qe3_4 , Primer sequence: CGGATAACAATTCACACAG. 
FEATURES Location/Qualifiers 
source 1. .326 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/clone="RZPDp9017D208 " 

/tissue_type="T-Lymphocytes" 

/ dev_s tage= " adult " 

/lab_host="DH10B" 

/clone_lib="RZPD no. 9017" 

/note="Vector : pQE80LSN_cloned; Site_l: Sail; Site_2 : 
NotI; vector: 

http : //www. r zpd . de/inf o/vectors/pQE80LSN_cloned_jpic . shtml 
; 1st strand cDNA was prepared from mRNA obtained from 
human T-Lymphocytes with a NotI - oligo(dT) primer [5 1 
GACTAGTTCTAGATCGCGAGCGGCCGCCCTTTTTTTTTTTTTTT 3 ' ] . 
Double-stranded cDNA was ligated to Sail adaptors, 
digested with NotI and cloned into the NotI and Sail sites 
of the pQE80LSN_cloned vector" 

ORIGIN 

Query Match 12.2%; Score 78; DB 8; Length 326; 

Best Local Similarity 100.0%; Pred. No. 2e-07; 

Matches 78; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 560 GCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCG 619 

I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 GCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCG 60 

Qy 620 CCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I I 
Db 61 CCCGGCTCGTCCGACGCG 78 



RESULT 12 

BM552598 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BM552598 1012 bp mRNA linear EST 20-FEB-2002 

AGENCOURT_654 3723 NIH_MGC_71 Homo sapiens cDNA clone IMAGE : 5549778 
5 1 , mRNA sequence . 
BM552598 

BM552598.1 GI : 18790596 
EST. 

Homo sapiens (human) 
Homo, sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 

Hominidae; Homo . 

1 (bases 1 to 1012) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 

Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: ATCC 
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cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
http : //image . llnl . gov 
Plate: LLAM12260 row: n column: 19 
High quality sequence stop: 524. 
FEATURES Location/Qualifiers 
source 1. .1012 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon:9606" 

/clone=" IMAGE : 554 9778 " 

/ tis sue_type= " leiomyo s ar c oma " 

/lab_host="DH10B (phage-resistant ) " 

/clone_lib="NIH_MGC_71" 

/note="Organ: uterus; Vector: pCMV-SPORT6; Site_l: NotI; 
Site_2: Sail; Cloned unidirectionally . Primer: Oligo dT. 
Average insert size 2.1 kb. " 

ORIGIN 

Query Match 12.1%; Score 77; DB 2; Length 1012; 

Best Local Similarity 100.0%; Pred. No. 3.6e-07; 

Matches 77; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 561 CGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGC 620 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 CGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGC 60 

Qy 621 CCGGCTCGTCCGACGCG 637 

I I I I I I I I I I 1 I I I I I I 
Db 61 CCGGCTCGTCCGACGCG 77 



RESULT 13 

DA947305 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
PUBMED 
COMMENT 



DA947305 512 bp mRNA linear EST 08-NOV-2005 

DA947305 SPLEN2 Homo sapiens cDNA clone SPLEN2023327 5', mRNA 
sequence. 
DA947305 

DA947305.1 GI: 81200519 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo. 
1 (bases 1 to 512) 

Kimura,K., Wakamatsu, A. , Suzuki, Y., Ota,T., Nishikawa, T . , 
Yamashita,R. , Yamamoto,J., Sekine,M., Tsuritani, K. , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono,Y., Irle,R., Kushida,N., 
Yoneyama,T., Otsuka,R., Kanda,K., Yokoi,T., Kondo,H., Wagatsuma, M. , 
Murakawa, K. , Ishida,S., Ishibashi, T . , Takahashi-Fujii, A. , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai,T. and Sugano,S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
16344560 

Contact: Takao Isogai 
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FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Tel: 81-438-52-3975 
Fax: 81-438-52-3986 
Email : f 1 j -cdnaQnif ty . com 

NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan) ; cDNA library construction: 
Helix Research Institute (HRI); 5 1 -end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3' -end one 
pass sequencing: RAB. 
FEATURES Location/Qualifiers 
source 1. .512 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon:9606" 

/clone="SPLEN2023327" 

/tissue_type=" spleen" 

/clone_lib="SPLEN2" 

/note="Vector : pME18SFL3" 

ORIGIN 

Query Match 11.7%; Score 74.4; DB 9; Length 512; 

Best Local Similarity 98.7%; Pred. No. 1.4e-06; 

Matches 75; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 562 GCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCC 621 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
Db 4 GCGAAGC C C CAC T GC GAAGC CC AGC T GC GC GC GCC T T GGGAT T GAC TG TCC ACGC T CGC C 63 

Qy 622 CGGCTCGTCCGACGCG 637 

I I I I I f I I I I I I I I I I 
Db 64 CGGCTCGTCCGACGCG 7 9 



RESULT 14 

DB194024 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
PUBMED 



DB194024 512 bp mRNA linear EST 03-DEC-2005 

DB194024 TOVAR2 Homo sapiens cDNA clone TOVAR2002496 5', mRNA 
sequence . 
DB194024 

DB194 024.1 GI: 83151096 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
1 (bases 1 to 512) 

Kimura,K., Wakamatsu, A. , Suzuki, Y. , Ota,T., Nishikawa, T . , 
Yamashita,R. , Yamamoto,J., Sekine,M., Tsuritani, K. , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono,Y., Irie,R., Kushida,N., 
Yoneyama,T., Otsuka,R., Kanda,K., Yokoi,T., Kondo,H., Wagatsuma, M. , 
Murakawa, K. , Ishida, S . , Ishibashi, T . , Takahashi-Fuj ii, A. , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai,T. andSugano,S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
16344560 
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COMMENT Contact: Takao Isogai 

FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Tel: 81-438-52-3975 
Fax: 81-438-52-3986 
Email : flj-cdna@nifty.com 

NED0 human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan) ; cDNA library construction: 
Helix Research Institute (HRI); 5' -end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3 ' -end one 
pass sequencing: RAB. 
FEATURES Location/Qualifiers 
source 1. .512 

/organism="Homo sapiens" 

/mol_t ype= "mRNA" 

/db_xref ="taxon: 9606" 

/clone="TOVAR20024 96" 

/tissue_type="ovary, tumor tissue" 

/clone_lib="T0VAR2" 

/note="Vector : pME18SFL3" 

ORIGIN 

Query Match 11.7%; Score 74.4; DB 9; Length 512; 

Best Local Similarity 98.7%; Pred. No. 1.4e-06; 

Matches 7 5; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 562 GCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCC 621 

Ml I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 4 GCGAAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCC 63 

Qy 622 CGGCTCGTCCGACGCG 637 

II I i I I I I I I I I I I I I 
Db 64 CGGCTCGTCCGACGCG 79 



RESULT 15 

DA650717 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 



DA650717 551 bp mRNA linear EST 07-NOV-2005 

DA650717 MESAN2 Homo sapiens cDNA clone MESAN2004345 5', mRNA 
sequence . 
DA650717 

DA650717.1 GI: 8113708 9 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
1 (bases 1 to 551) 

Kimura,K., Wakamatsu, A. , Suzuki, Y., Ota,T., Nishikawa, T. , 
Yamashita,R., Yamamoto,J., Sekine,M., Tsuritani, K. , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono,Y., Irie,R., Kushida,N., 
Yoneyama,T., Otsuka,R., Kanda,K., Yokoi,T., Kondo,H., Wagatsuma, M. , 
Murakawa, K. , Ishida, S . , Ishibashi, T . , Takahashi-Fujii, A. , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai, T. and Sugano,S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
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PUBMED 
COMMENT 



FEATURES 

source 



16344560 

Contact: Takao Isogai 
FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Tel: 81-438-52-3975 
Fax: 81-438-52-3986 
Email : flj-cdna@nifty.com 

NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan); cDNA library construction: 
Helix Research Institute (HRI); 5 ' -end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3 ' -end one 
pass sequencing: RAB. 

Location/Qualifiers 

1. .551 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="MESAN2004 345" 

/cell_type="normal mesangial cells (NHMC56046-2 ) " 
/clone_lib="MESAN2" 

/note="Vector : pME18SFL3; primary culture, normal 
mesangial cells" 



ORIGIN 



Query Match 11.7%; 
Best Local Similarity 98.7%; 
Matches 75; Conservative 



Score 74.4; DB 9; 
Pred. No. 1.4e-06; 
0; Mismatches 1; 



Length 551; 



Indels 



0 ; Gaps 



0; 



Qy 



Db 



562 GCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCC 621 
III I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
4 GC GAAGC C CC AC T GC GAAGCCC AGC TGC GCGC GC C T T GGGAT TGAC T G T CC ACGC T CGC C 63 



Qy 



Db 



622 CGGCTCGTCCGACGCG 637 
I I I I I I I I I I I I I I I I 
64 CGGCTCGTCCGACGCG 79 



Search completed: June 6, 2006, 22:34:03 
Job time : 4154 sees 



SCORE 1.3 BuildDate: 12/06/2005 
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SCORE Search Results Details for Application 
09869098 and Search Result us-09-869-098a- 
l__copy_1634_2270.rnpbn- 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 09869098 and Search Result us-09- 

869-098a-l_copy_1634_2270.rnpbn. 

start 

Go Back to previous paq 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



June 6, 2006, 22:23:01 ; Search time 25 Seconds 

(without alignments) 
3000.881 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



US-09-869-098A-l_COPY_1634_2270 
637 

1 acctgttgctgcagttctga cgcccggctcgtccgacgcg 637 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



246837 seqs, 58886990 residues 



Total number of hits satisfying chosen parameters: 



493674 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing : 



Database : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Publ i s he d_App 1 i c a t ion s_NA_Ne w : * 



/EMC_Celerra_SIDS3/ptodata/2/pubpna/US09_NEW_PUB.seq:* 

/EMC_Celerra_SIDS3/ptodata/2/pubpna/US06_NEW_PUB.seq:* 

/ EMC_Cele r r a_S IDS 3 /ptoda ta /2 /pubpna /US 0 7_NEW_PUB .seq:* 

/EMC_Celerra_SIDS3/ptodata/2/pubpna/US08_NEW_PUB.seq:* 

/EMC_Celerra_SIDS3/ptodata/2/pubpna/PCT_NEW_PUB.seq:* 

/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10_NEW_PUB.seq:* 

/EMC_Celerra_SIDS3/ptodata/2/pubpna/USll_NEW_PUB.seq:* 

/EMC_Celerra_SIDS3/ptodata/2/pubpna/US60_NEW_PUB.seq:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 
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No. Score Match Length DB ID Description 





1 


41 


.8 


6. 


6 


1214 


6 


US- 


10- 


953- 


349 


-30760 


Sequence 


30760, A 


c 


2 


41 


.8 


6. 


6 


4563 


6 


US- 


10- 


196- 


749 


-311 


Sequence 


311, App 




3 




41 


6. 


4 


3219 


7 


US- 


11- 


293- 


697 


-1221 


Sequence 


1221, Ap 


c 


4 


40 


.8 


6. 


4 


2050 


6 


US- 


10- 


953- 


349 


-31154 


Sequence 


31154, A 


c 


5 


40 


.4 


6. 


3 


2684 


6 


US- 


10- 


199- 


229 


-7 


Sequence 


7, Appli 




6 


39 


.8 


6. 


2 


1400 


6 


US- 


10- 


953- 


349 


-28773 


Sequence 


28773, A 


c 


7 


38 


.8 


6. 


1 


42999 


7 


US- 


11- 


284- 


877 


-17 


Sequence 


17, Appl 




8 




38 


6. 


0 


1339 


6 


US- 


10- 


505- 


928 


-53 


Sequence 


53, Appl 




9 




38 


6. 


0 


2715 


7 


US- 


11- 


293- 


697 


-1028 


Sequence 


1028, Ap 


c 


10 


37 


.8 


5. 


9 


461 


6 


US- 


10- 


953- 


349 


-30799 


Sequence 


30799, A 




11 


37 


.6 


5. 


9 


706 


6 


US- 


10- 


953- 


349 


-31527 


Sequence 


31527, A 




12 


37 


.4 


5. 


9 


461 


6 


US- 


10- 


953- 


349 


-30799 


Sequence 


30799, A 




13 


37 


.4 


5. 


9 


1073 


6 


US- 


10- 


953- 


349 


-37736 


Sequence 


37736, A 




14 


37 


.4 


5. 


9 


1354 


7 


US- 


11- 


293- 


697 


-208 


Sequence 


208, App 




15 


37 


.4 


5. 


9 


5703 


7 


us- 


11- 


312- 


958 


-63 


Sequence 


63, Appl 




16 


37 


.2 


5. 


8 


955 


6 


us- 


10- 


953- 


349 


-33699 


Sequence 


33699, A 




17 




37 


5. 


8 


1024 


6 


us- 


10- 


953- 


349 


-33192 


Sequence 


33192, A 




18 


36 


.6 


5. 


7 


1114 


6 


us- 


10- 


953- 


349 


-32119 


Sequence 


32119, A 


c 


19 


36 


.4 


5. 


7 


1569 


6 


us- 


10- 


953- 


349 


-33558 


Sequence 


33558, A 


c 


20 


36 


.4 


5. 


7 


1958 


6 


us- 


10- 


953- 


349 


-38532 


Sequence 


38532, A 




21 




36 


5. 


7 


2213 


6 


us- 


10- 


953- 


349 


-27640 


Sequence 


27640, A 


c 


22 




36 


5. 


7 


3020 


7 


us- 


11- 


293- 


697 


-645 


Sequence 


645, App 




23 




36 


5. 


7 


38187 


6 


us- 


10- 


857- 


260 


-31 


Sequence 


31, Appl 




24 


35 


.8 


5. 


6 


2995 


7 


us- 


11- 


315- 


777 


-7 


Sequence 


7, Appli 


c 


25 


35 


.6 


5. 


6 


706 


6 


us- 


10- 


953- 


349 


-31527 


Sequence 


31527, A 




26 


35 


.4 


5. 


6 


1540 


6 


us- 


10- 


953- 


349 


-29034 


Sequence 


29034, A 




27 


35 


.2 


5. 


5 


1587 


7 


us- 


11- 


293- 


697 


-136 


Sequence 


136, App 




28 


35 


.2 


5. 


5 


1822 


6 


us- 


10- 


505- 


928 


-16 


Sequence 


16, Appl 




29 


35 


.2 


5. 


5 


2134 


6 


us- 


10- 


196- 


749 


-369 


Sequence 


369, App 


c 


30 


35 


.2 


5. 


5 


2267 


7 


us- 


11- 


293- 


697 


-2354 


Sequence 


2354, Ap 


c 


31 


35 


.2 


5. 


5 


4763 


6 


us- 


10- 


505- 


928 


-374 


Sequence 


374, App 




32 




35 


5. 


5 


1122 


6 


us- 


10- 


953- 


349 


-31251 


Sequence 


31251, A 




33 




35 


5. 


5 


1357 


6 


us- 


10- 


953- 


349 


-34605 


Sequence 


34605, A 




34 




35 


5. 


5 


1667 


6 


us- 


10- 


505- 


928 


-419 


Sequence 


419, App 




35 


34 


.6 


5. 


4 


907 


6 


us- 


10- 


953- 


349 


-15391 


Sequence 


15391, A 




36 


34 


.6 


5. 


4 


1533 


6 


us- 


10- 


953- 


349 


-31890 


Sequence 


31890, A 


c 


37 


34 


.6 


5. 


4 


1727 


7 


us- 


11- 


293- 


697 


-1686 


Sequence 


1686, Ap 




38 


34 


. 6 


5. 


4 


1841 


6 


us- 


10- 


953- 


349 


-34354 


Sequence 


34354, A 


c 


39 


34 


.4 


5. 


4 


626 


6 


us- 


10- 


953- 


349 


-39430 


Sequence 


39430, A 




40 


34 


.4 


5. 


4 


825 


6 


us- 


10- 


953- 


349 


-39057 


Sequence 


39057, A 




41 


34 


.4 


5. 


4 


1170 


6 


us- 


10- 


■953- 


349 


-32139 


Sequence 


32139, A 


c 


42 


34 


.4 


5. 


4 


1337 


6 


us- 


10- 


953- 


349 


-35245 


Sequence 


35245, A 




43 


34 


.4 


5. 


4 


1374 


6 


us- 


10- 


953- 


349 


-31642 


Sequence 


31642, A 


c 


44 


34 


.4 


5. 


4 


1466 


6 


us- 


10- 


■505- 


928 


-458 


Sequence 


458, App 


c 


45 


34 


.4 


5. 


4 


2151 


7 


us- 


11- 


-245- 


628 


-34 


Sequence 


34, Appl 



ALIGNMENTS 



RESULT 1 

US-10-953-349-30760 

; Sequence 30760, Application US/10953349 
; Publication No. US20060107345A1 
; GENERAL INFORMATION: 

; APPLICANT: ALEXANDROV, Nickolai et al. 

; TITLE OF INVENTION: SEQUENCE-DETERMINED DNA FRAGMENTS AND CORRESPONDING POLYPEPTIDE 
; TITLE OF INVENTION: ENCONDED THERBY 
; FILE REFERENCE: 2750-1579PUS2 
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; CURRENT APPLICATION NUMBER : US/10/953,34 9 
; CURRENT FILING DATE: 2004-09-30 
; NUMBER OF SEQ ID NOS : 40252 

SOFTWARE: Patentln version 3.3 
; SEQ ID NO 30760 

LENGTH: 1214 

TYPE: DNA 
; ORGANISM: Triticum aestivum 
US-10-953-349-30760 

Query Match 6.6%; Score 41.8; DB 6; Length 1214; 

Best Local Similarity 56.0%; Pred. No. 0.15; 

Matches 7 9; Conservative 0; Mismatches 62; Indels 0; Gaps 0; 

Qy 456 CTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCC 515 

I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I 

Db 321 CGGGTTCCGGTTCCGGCCCACCGACGAGGAGCTGCTGCTGCACTACCTGCGCCGCAAGGC 380 

Qy 516 CCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTG 575 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 381 CCTCTCCTGCCCCCTCCCCGCCGACATCATCCCCGTCGCCGACCTCGCGCGCCTCCACCC 440 

Qy 57 6 CGAAGCCCAGCTGCGCGCGCC 596 

I I I I I I I I I I 
Db 441 ATCGGAC C T GC C AGGC GAGGC 461 



RESULT 2 

US-10-196-74 9-311/C 

Sequence 311, Application US/10196749 
Publication No. US20060094864A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT : Chen, Jian 
APPLICANT: Desnoyers , Luc 
APPLICANT : Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Gurney, Austin L. 
AP PLIC ANT : Pan , Jame s 
APPLICANT : Smith, Victoria 
APPLICANT: Watanabe, Colin K. 
APPLICANT: Wood, William I. 
APPLICANT : Zhang, Zemin 

TITLE OF INVENTION: SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC 
TITLE OF INVENTION: ACIDS ENCODING THE SAME 
FILE REFERENCE: P3430R1C340 
CURRENT APPLICATION NUMBER: US/10/196,749 
CURRENT FILING DATE: 2002-07-16 
PRIOR APPLICATION NUMBER: 10/052586 
PRIOR FILING DATE: 2002-01-15 
PRIOR APPLICATION NUMBER: 60/059263 
PRIOR FILING DATE: 1997-09-18 
PRIOR APPLICATION NUMBER: 60/059266 
PRIOR FILING DATE: 1997-09-18 
PRIOR APPLICATION NUMBER: 60/062250 
PRIOR FILING DATE: 1997-10-17 
PRIOR APPLICATION NUMBER: 60/063120 
PRIOR FILING DATE: 1997-10-24 
PRIOR APPLICATION NUMBER: 60/063121 
PRIOR FILING DATE: 1997-10-24 
PRIOR APPLICATION NUMBER: 60/063486 
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PRIOR FILING DATE: 1997-10-21 
PRIOR APPLICATION NUMBER: 60/063540 
PRIOR FILING DATE: 1997-10-28 
PRIOR APPLICATION NUMBER: 60/063541 
PRIOR FILING DATE: 1997-10-28 
PRIOR APPLICATION NUMBER: 60/063544 
PRIOR FILING DATE: 1997-10-28 

Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 612 
SEQ ID NO 311 
LENGTH: 4563 
TYPE: DNA 

ORGANISM: Homo Sapien 
FEATURE : 

NAME /KEY : unsure 
LOCATION: 3635 

OTHER INFORMATION: unknown base 
US-10-196-749-311 

Query Match 6.6%; Score 41.8; DB 6; Length 4563; 

Best Local Similarity 55.0%; Pred. No. 0.2; 

Matches 82; Conservative 0; Mismatches 67; Indels 0; Gaps 0; 

Qy 424 CCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAG 483 

\\ III I I I I I I I I I I I I I I I II III I 

Db 4 61 CCGCCGCCGCCGCCGCCGCCGCCGCAGGCTCCGCGCGCCCTCAGCCCGCTGCCCCCGCCC 402 

Qy 484 GAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTA 543 

I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 401 ACGGAGGCCAGCCCGCCCTAGCGCCGCGAGGCCCGGCCCGGGCAGCGCCGCCGCCGCCGC 342 

Qy 544 AGCCGCGCCGCCGCCTGCGCGGAGCCCCA 572 

I I I I I I I I II I I I I I Mil 
Db 341 GGGCCCGCCGCGCCCATCGCGGCCCCCGA 313 



RESULT 3 

US-11-293-697-1221 

; Sequence 1221, Application US/11293697 
; Publication No. US20060105376A1 
; GENERAL INFORMATION: 

; APPLICANT: HELIX RESEARCH INSTITUTE 

; TITLE OF INVENTION: Novel full length cDNA 

; FILE REFERENCE: H1-A0106 

; CURRENT APPLICATION NUMBER: US/11/293,697 

; CURRENT FILING DATE: 2005-12-05 

; PRIOR APPLICATION NUMBER: US/10/108,260 

; PRIOR FILING DATE: 2002-03-28 

; NUMBER OF SEQ ID NOS: 5458 

; SOFTWARE: Patentln Ver . 2.1 

; SEQ ID NO 1221 

LENGTH: 3219 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-11-293-697-1221 

Query Match 6.4%; Score 41; DB 7; Length 3219; 

Best Local Similarity 52.0%; Pred. No. 0.28; 

Matches 92; Conservative 0; Mismatches 85; Indels 0; Gaps 0; 
Qy 413 CATGGAGAGAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCG 472 
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I I I I I II 1 1 1 1 1 1 II I 1 1 1 1 1 1 1 I 1 1 1 I II III 

Db 168 C AGGC AGGAGAC TGGAGACGAGGT C T TGC AGGCGGAAC TC AGGAT GC T C TGAGC TGC C C G 227 

Qy 473 CACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCC 532 

III I I I I I I I I I I I I I I I I I I I I III I I I I I I 

Db 228 CACAACCCCTGGACCTTCACCCCTCGCCCCTTCCCCGCATCCAGCTGCCCCAGCCCCTGC 287 

Qy 533 CCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGC 589 

III I I I I III I I I I I I I I I I I I I I I I I 

Db 288 CCAGGCTGCGTAGCCTAGCGGGGGTCTGCGGTCCTAGCCCCTCCCCGCGCCACCTAC 344 



RESULT 4 

US-10-953-349-31154/C 

; Sequence 31154, Application US/10953349 
; Publication No. US20060107345A1 
; GENERAL INFORMATION : 

; APPLICANT: ALEXANDROV, Nickolai et al . 

; TITLE OF INVENTION: SEQUENCE- DETERMINED DNA FRAGMENTS AND CORRESPONDING POLYPEPTIDE 

; TITLE OF INVENTION: ENCONDED THERBY 

; FILE REFERENCE: 2750-1579PUS2 

; CURRENT APPLICATION NUMBER: US/10/953,34 9 

; CURRENT FILING DATE: 2004-09-30 

; NUMBER OF SEQ ID NOS : 40252 

; SOFTWARE: Patentln version 3.3 

; SEQ ID NO 31154 

LENGTH: 2050 

TYPE: DNA 
; ORGANISM: Triticum aestivum 
US-10-953-349-31154 

Query Match 6.4%; Score 40.8; DB 6; Length 2050; 

Best Local Similarity 50.5%; Pred. No. 0.28; 

Matches 99; Conservative 0; Mismatches 97; Indels 0; Gaps 0; 

Qy 435 GCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGG 494 

I I I I I I I I I I I I I I I I I I I I I I II II I I I I 

Db 1586 GC GGCAC C AGC C AGAACGC C GAC ATGCC GGCGC C GC CGC C GCCGGC CAGC TCC AGC C TGC 1527 

Qy 495 CCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGC 554 

III I I I I I I I II I I I I I I I I I I I I I I I III 

Db 1526 GCCGCGCGCCCTCCACCGCGCCGGACACGGCCATCGCCGCGGTGGCCAGCAGCAGGCCCA 1467 

Qy 555 CGC CTGCGCGGAGCCCCACTGCGAAGCC CAGC TGCGCGCGCCTTGGGATTGACTGTCCAC 614 

I I I I I I I I II II III I I I I I I I II I I I II I 

Db 1466 CGCCGACGCGCTGCTTCATGGTGAGCCCCCGCGGGTCGCCCGTCACCCGTCGCAGCGCCG 1407 

Qy 615 GCTCGCCCGGCTCGTC 630 

I I I I I I III 
Db 1406 GCGCGATCCACCTGTC 1391 



RESULT 5 

US-10-199-229-7/C 

; Sequence 7, Application US/10199229 
; Publication No. US20060099701A1 
; GENERAL INFORMATION: 
; APPLICANT: She, Jin-Xiong 

APPLICANT: Kumar, Pradeep 
; APPLICANT: Wang, Cong-Yi 

; TITLE OF INVENTION: ACTIVATORS OF CYCLIN-DEPENDENT KINASES 
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; FILE REFERENCE: 5853-210 

; CURRENT APPLICATION NUMBER : US/10/199,229 

; CURRENT FILING DATE: 2002-07-19 

; NUMBER OF SEQ ID NOS : 16 

; SOFTWARE: Patentln version 3.1 

; SEQ ID NO 7 

LENGTH: 2 684 

TYPE: DNA 
; ORGANISM: Mus musculus 
US-10-199-229-7 

Query Match 6.3%; Score 40.4; DB 6; Length 2684; 

Best Local Similarity 54.8%; Pred. No. 0.37; 



Matches 80; Conservative 0; Mismatches 66; Indels 0; Gaps 0; 

Qy 456 CTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCC 515 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 458 CGGCTCCACGCGCACCGCCAGCAGCGCGGAGCGCGGGTGGCCGGCCTCGGCGCGCAGGCG 399 

Qy 516 CCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTG 575 

I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 398 CAGCAGCGCGCGCCACTCGCCCGTCGGGGCGGCCGCCTCCTCGGGCTCCGTGGCCCGTCC 339 

Qy 57 6 CGAAGCCCAGCTGCGCGCGCCTTGGG 601 

II I I I I I I I I I I I I 
Db 338 GCCCTCCGGGCAGCCTGCGCCCTCGG 313 



RESULT 6 

US-10-953-349-28773 

; Sequence 28773, Application US/10953349 
; Publication No. US20060107345A1 
; GENERAL INFORMATION: 

; APPLICANT: ALEXANDROV, Nickolai et al. 

TITLE OF INVENTION: SEQUENCE- DETERMINED DNA FRAGMENTS AND CORRESPONDING POLYPEPTIDE 
; TITLE OF INVENTION: ENCONDED THERBY 
; FILE REFERENCE: 2750-157 9PUS2 
; CURRENT APPLICATION NUMBER: US/10/953,34 9 
; CURRENT FILING DATE: 2004-09-30 
; NUMBER OF SEQ ID NOS: 40252 

SOFTWARE: Patentln version 3.3 
; SEQ ID NO 28773 
LENGTH: 1400 
TYPE: DNA 

ORGANISM: Triticum aestivum 
FEATURE : 

NAME/KEY: misc_f eature 
LOCATION: (900).. (900) 

OTHER INFORMATION: n is a, c, g, or t 
US-10-953-349-28773 

Query Match 6.2%; Score 39.8; DB 6; Length 1400; 

Best Local Similarity 56.5%; Pred. No. 0.44; 

Matches 74; Conservative 0; Mismatches 57; Indels 0; Gaps 0; 

Qy 465 GGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGG 524 

I I M I III I I I I I I I Ml I I I I I M I I I I I I I I 
Db 50 GATGTCGACGGCGACGCGGATGCTGGCGCGCCGCGCCGCCCCCGCCCTCCTCCGCTCCGC 109 

Qy 525 GCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCA 584 

I I I I I I I I II II I I I I I I I II I I I I III I II I 
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Db 110 CGCCAGCCCCGGCCTCCTCCGCTCCCCCGCCGCCGGCACAGGCCCCAGCCCCGCCCTCGC 169 



Qy 585 GCTGCGCGCGC 595 

I I I I I I I I I 
Db 170 GCTGCCGGCGC 180 



RESULT 7 

US-11-284-877-17/C 

; Sequence 17, Application US/11284877 
; Publication No. US20060095984A1 
GENERAL INFORMATION : 

APPLICANT: Hadlaczky, Gyula 
; Szalay, Aladar 

TITLE OF INVENTION: ARTIFICIAL CHROMOSOMES, USES THEREOF AND METHODS 

FOR PREPARING ARTIFICIAL CHROMOSOMES 
NUMBER OF SEQUENCES: 34 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson 

STREET: 12390 El Camino Real 
; CITY: San Diego 

STATE: CA 

COUNTRY: USA 

ZIP : 92130 
COMPUTER READABLE FORM: 

MEDIUM TYPE: CD-ROM 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/11/284,87 7 

FILING DATE: 21-Nov-2005 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 10/808,689 

FILING DATE: 24-MAR-2004 

APPLICATION NUMBER: 10/219,694 

FILING DATE: 14-AUG-2002 

APPLICATION NUMBER: 10/151,081 

FILING DATE: 16-MAY-2002 

APPLICATION NUMBER: 10/151,078 

FILING DATE: 16-MAY-2002 

APPLICATION NUMBER: 10/125,767 

FILING DATE: 17-APR-2002 

APPLICATION NUMBER: 10/287,313 

FILING DATE: 01-NOV-2002 

APPLICATION NUMBER: 09/799,462 

FILING DATE: 05-MAR-2001 

APPLICATION NUMBER: 09/724,872 

FILING DATE: 28-NOV-2000 

APPLICATION NUMBER: 09/724,726 

FILING DATE: 28-NOV-2000 

APPLICATION NUMBER: 09/724,693 

FILING DATE: 28-NOV-2000 

APPLICATION NUMBER: 08/835,682 

FILING DATE: 10-APR-1997 

APPLICATION NUMBER: 08/695,191 

FILING DATE: 07-AUG-1996 

APPLICATION NUMBER: 08/682,080 

FILING DATE: 15-JUL-1996 

APPLICATION NUMBER: 08/629,822 
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FILING DATE: 10-APR-1996 
ATTORNEY / AGENT INFORMATION: 

NAME: Seidman, Stephanie L 
REGISTRATION NUMBER: 33,779 

REFERENCE/DOCKET NUMBER: 17084-004018/402Q 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 858-678-4777 
TELEFAX: 202-626-7796 
TELEX : 

INFORMATION FOR SEQ ID NO: 17: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 42999 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: Genomic DNA 
HYPOTHETICAL: NO 
ANTI-SENSE: NO 
FRAGMENT TYPE: 
ORIGINAL SOURCE: 

SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
US-11-284-877-17 

Query Match 6.1%; Score 38.8; DB 7; Length 42999; 

Best Local Similarity 48.6%; Pred. No. 1.5; 

Matches 106; Conservative 0; Mismatches 112; Indels 0; Gaps 0; 



Qy 387 CGC ACGGAAAC GGGAGTGGAGAAAGTCATGGAGAGAACCC TAGGCGGGGCGGTCCC CGCG 446 

I I I I I I I I I I III III II III I I I I I I I I 

Db 6431 C CC ACGC GGGGGC GAGC GCGGAC AC CAC C C C AC AGGC GCC C GGGGGT T CCC GCCC C CAC G 6372 

Qy 447 GAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCC 506 

I III till I I I I I I I I I I I I I I I I I I I 

Db 6371 GCGCGGGGCGCACGCCACACGCGCGGCAGGCGCGCGACGGCCGCCGGGTAAAGCCCCCAC 6312 

Qy 507 CCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGA 566 

III I I I I I I III III I I I I I I I I I I III I I I I I I 

Db 6311 CCGACGGCCGCCGCGGCGGCGGCGGCGGCGCGGCCCCGGCCGGGGAGCGGAGTCCGCGGT 6252 

Qy 567 GCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATT 604 

I I I I I I I III! I I II I 

Db 6251 GGAGGCGCGGGAGGGGC C GGGC CC C T CC CGAC GGGAC T 6214 



RESULT 8 

US-10-505-928-53 

; Sequence 53, Application US/10505928 
; Publication No. US20060088532A1 
; GENERAL INFORMATION: 

; APPLICANT: Ludwig Institute for Cancer Research et al. 

; TITLE OF INVENTION: LYMPHATIC ENDOTHELIAL GENES 

; FILE REFERENCE: 28967/39178 

; CURRENT APPLICATION NUMBER: US/10/505,928 

; CURRENT FILING DATE: 2004-08-27 

; PRIOR APPLICATION NUMBER: US 60/363,019 

; PRIOR FILING DATE: 2002-03-07 

; NUMBER OF SEQ ID NOS : 866 

; SOFTWARE: Patentln 3.2 

; SEQ ID NO 53 

LENGTH: 1339 

TYPE: DNA 
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ORGANISM: Homo sapiens 
FEATURE : 

OTHER INFORMATION : Human Sox-18 
US-10-505-928-53 

Query Match 6.0%; Score 38; DB 6; Length 1339; 

Best Local Similarity 53.3%; Pred. No. 1.1; 

Matches 80; Conservative 0; Mismatches 70; Indels 0; Gaps 0; 

Qy 486 GCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAG 545 

I I M I I I I I I I I I I I I I I III III I I I I I I I I I II 

Db 233 GCTGACACGCGCGGCCTCGCCGCCGGCCCCGCCGCCCTCGCCGCGCCCGCCGCGCCCGCC 292 

Qy 54 6 CCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTG 605 

III I I I I I I I I I I I I II I I I I I I III II II I 

Db 293 TCGCCGCCCAGCCCGCAGCGCAGTCCCCCGCGCAGCCCCGAGCCGGGGCGCTATGGCCTC 352 

Qy 606 ACTGTCCACGCTCGCCCGGCTCGTCCGACG 635 

I II III II I I I I I I 

Db 353 AGC C C GGCC GGC C GC GGGGAAC GC C AGGC G 382 



RESULT 9 

US-11-293-697-1028 

; Sequence 1028 r Application US/11293697 
; Publication No. US20060105376A1 
; GENERAL INFORMATION: 

; APPLICANT: HELIX RESEARCH INSTITUTE 

; TITLE OF INVENTION: Novel full length cDNA 

; FILE REFERENCE: H1-A0106 

; CURRENT APPLICATION NUMBER: US/11/2 93,697 

; CURRENT FILING DATE: 2005-12-05 

; PRIOR APPLICATION NUMBER: US/10/108,260 

; PRIOR FILING DATE: 2002-03-28 

; NUMBER OF SEQ ID NOS : 5458 

; SOFTWARE: Patentln Ver . 2.1 

; SEQ ID NO 1028 

LENGTH: 2715 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-11-293-697-1028 

Query Match 6.0%; Score 38; DB 7; Length 2715; 

Best Local Similarity 53.3%; Pred. No. 1.3; 

Matches 80; Conservative 0; Mismatches 70; Indels 0; Gaps 0; 

Qy 487 CTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGC 546 

Ml I I I I I I III II II I I I I I I I I I I M 

Db 1676 CCGGGCGCCCCGGAGCCGAAGCCCGAGCCGAGGAGGTGGCGCTCGCAGCCGCGTCTCAGC 1735 

Qy 547 CGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGA 606 

I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I III I 
Db 1736 CCCGTCGCCGCCGGCGCCAAGACCCAGGGAGAGGCGGCGCAGCCCGGGCGCTCGGGCTCC 1795 

Qy 607 CTGTCCACGCTCGCCCGGCTCGTCCGACGC 636 

I II I I I I I I I I I I I I 

Db 1796 GCGCTGCCGGCGCCCCGCCACCCCCGCCGC 1825 



RESULT 10 

US-10-953-349-30799/C 
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; Sequence 30799, Application US/10953349 
; Publication No. US20060107345A1 
; GENERAL INFORMATION: 

; APPLICANT: ALEXANDROV, Nickolai et al. 

; TITLE OF INVENTION: SEQUENCE-DETERMINED DNA FRAGMENTS AND CORRESPONDING POLYPEPTIDE 

; TITLE OF INVENTION: ENCONDED THERBY 

; FILE REFERENCE: 2750-157 9PUS2 

; CURRENT APPLICATION NUMBER: US/10/953,349 

; CURRENT FILING DATE: 2004-09-30 

; NUMBER OF SEQ ID NOS : 40252 

; SOFTWARE: Patentln version 3.3 

; SEQ ID NO 30799 

LENGTH: 4 61 

TYPE: DNA 
; ORGANISM: Triticum aestivum 
US-10-953-349-30799 

Query Match 5.9%; Score 37.8; DB 6; Length 4 61; 

Best Local Similarity 51.5%; Pred. No. 1; 

Matches 87; Conservative 0; Mismatches 82; Indels 0; Gaps 0; 

Qy 451 GGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGC 510 

I I I I I I I I I I 1 I I I I I I I I I I I II I I I II II I 

Db 192 GGCGGCGGCGCCGACTCGTCCTCGCTGGAGGAGGAGTCGGTGGCGTCGGCGTCGTGCACG 133 

Qy 511 AGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCC 570 

II I I I I I II III I II I I I I I I I I I I I I I I II 

Db 132 ATGATCCGCACCACGCGGCCCCTGGGCTGCGCCTCCGCCGCAGCCGCCGCCGCCGCCGCC 73 

Qy 571 C AC TGC GAAGC CC AGC TGC GCGCGC C T T GGGATT GAC T GT C C AC GC TC G 619 

II II I I I I I I I II I I I I I I I I I I I 

Db 72 TCGGCCGCCGACTCCATCCCCGCGGCGGCGGCCAGCATGTCCAGGCTAG 24 



RESULT 11 

US-10-953-349-31527 

; Sequence 31527, Application US/10953349 
; Publication No. US20060107345A1 
; GENERAL INFORMATION: 

; APPLICANT: ALEXANDROV, Nickolai et al. 

; TITLE OF INVENTION: SEQUENCE- DETERMINED DNA FRAGMENTS AND CORRESPONDING POLYPEPTIDE 

; TITLE OF INVENTION: ENCONDED THERBY 

; FILE REFERENCE: 2750-157 9PUS2 

; CURRENT APPLICATION NUMBER: US/10/953,349 

; CURRENT FILING DATE: 2004-09-30 

; NUMBER OF SEQ ID NOS: 40252 

SOFTWARE: Patentln version 3.3 
; SEQ ID NO 31527 

LENGTH: 706 

TYPE: DNA 
; ORGANISM: Triticum aestivum 
US-10-953-349-31527 

Query Match 5.9%; Score 37.6; DB 6; Length 706; 

Best Local Similarity 45.8%; Pred. No. 1.2; 

Matches 130; Conservative 0; Mismatches 154; Indels 0; Gaps 0; 

Qy 288 C TC ACAGAGGT TGGC GGC C CGAGAGAGT GTGAGGC AGAGGC GGGGAGT GGCAAGGGAGTG 347 

I I I I I I I I I III II Ml I I I I I I I II 

Db 121 CTCCGAGCCGTGTGAGAGACGATGAAGAAGGGGGGTCTGCTGCGGTGCGTGTCGACGGGG 180 
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Qy 348 ACCATCTCGGGGAACGAAGGAGTAAACGCGGTGATGGGACGCACGGAAACGGGAGTGGAG 407 

I I II I III I I II I I I I I III I I I I III 

Db 181 GCGTGCAGGGTGGCGCCGGGGGCCGTGGCGGAGATGGCCATCTCGCCGACGGCGTCGGGG 240 

Qy 408 AAAGTC AT GGAGAGAAC C C T AGGC GGGGCGGT C C C C GCGGAAAGGCGGC TGC TC C AGGGT 467 

I I I I I I I I II I I I I I I I I I I I I II I I III 

Db 241 AAGGTGC CGGC GGGGCACGTGCCGGTGGAGGTGGGCGCGGAGGGGGAGGAGACGGAGCGC 300 

Qy 468 CTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCC 527 

I I I I III III I I I I I I I I I I I I I I I I 

Db 301 TTCCTCGTGCCGGCCGAGCTGCTGGGCCGCCCTCCCATCGCCGAGCTGCTCCGCCGGGCC 360 

Qy 528 CCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCC 571 

III I I I I I I I I I I I I I I I I III 

Db 361 GCGCAGGAGT AC GGC T AC GC GCGACGC GGC C C GC TC C GC ATC CC 404 



RESULT 12 

US-10-953-349-30799 

; Sequence 30799, Application US/10953349 
; Publication No. US20060107345A1 
; GENERAL INFORMATION: 

; APPLICANT: ALEXANDROV, Nickolai et al . 

; TITLE OF INVENTION: SEQUENCE- DETERMINED DNA FRAGMENTS AND CORRESPONDING POLYPEPTIDE 

; TITLE OF INVENTION: ENCONDED THERBY 

; FILE REFERENCE: 2750-1579PUS2 

; CURRENT APPLICATION NUMBER: US/10/953,34 9 

; CURRENT FILING DATE: 2004-09-30 

; NUMBER OF SEQ ID NOS : 40252 

SOFTWARE: Patentln version 3.3 
; SEQ ID NO 30799 

LENGTH: 4 61 

TYPE : DNA 
; ORGANISM: Triticum aestivum 
US-10-953-349-30799 

Query Match 5.9%; Score 37.4; DB 6; Length 461; 

Best Local Similarity 53.0%; Pred. No. 1.2; 

Matches 80; Conservative 0; Mismatches 71; Inciels 0; Gaps 0; 

Qy 451 GGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGC 510 

I I I I I I II I I I I I I I I I I I III I II III 

Db 85 GGCGGCTGCGGCGGAGGCGCAGCCCAGGGGCCGCGTGGTGCGGATCATCGTGCACGACGC 144 

Qy 511 AGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCC 570 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II 

Db 145 CGACGCCACCGACTCCTCCTCCAGCGAGGACGAGTCGGCGCCGCCGCCGCCGCTGGCCAC 204 

Qy 571 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGG 601 

I I I I I I I I I I I I I I II 

Db 205 GAT GC CGACGACCAAGGAGGAGC GCCGGAGG 235 



RESULT 13 

US-10-953-349-37736 

; Sequence 37736, Application US/10953349 
; Publication No. US20060107345A1 
; GENERAL INFORMATION: 

; APPLICANT: ALEXANDROV, Nickolai et al. 

; TITLE OF INVENTION: SEQUENCE- DETERMINED DNA FRAGMENTS AND CORRESPONDING POLYPEPTIDE 
; TITLE OF INVENTION: ENCONDED THERBY 
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; FILE REFERENCE: 2750-1579PUS2 

; CURRENT APPLICATION NUMBER: US/10/953,349 

; CURRENT FILING DATE: 2004-09-30 

; NUMBER OF SEQ ID NOS : 40252 

; SOFTWARE: Patentln version 3.3 

; SEQ ID NO 37736 

LENGTH: 1073 

TYPE: DNA 

ORGANISM: Zea mays subsp. mays 
US-10-953-349-37736 

Query Match 5.9%; Score 37.4; DB 6; Length 1073; 

Best Local Similarity 48.7%; Pred. No. 1.5; 

Matches 131; Conservative 0; Mismatches 136; Indels 2; Gaps 1; 



Qy 327 GC GGGGAGTGGC AAGGGAGTGACC AT C T CGGGGAAC GAAGGAGT AAAC GCGGTGAT GGGA 386 

II I I I I I I I III I I I I I I I I II II 

Db 469 GCGGGAGGCGGCGGCTCCCTGAGGCAGTGGTGGCGGCGGGGCGTCGTCGGCTGGCTCCTG 528 

Qy 387 CGC ACGGAAAC GGGAGT GGAGAAAGTCATGGAGAGAAC C C T AGGCGGGGC GG TC CC CGCG 446 

I II I I I I III I I I I I I I II I I I I I I I I I I 

Db 529 TC C C GGAAGCC GGGG T TCGC GGGC GACC T CGAGATGAAC GAGGAC GAGGCCC GCC AC GGC 588 

Qy 447 GAAAGGC GGC TGC TC CAGGGTCTC CGCACCCAAGTAGGAGCTGGCAGGC — CCGGCCCCG 504 

IN! Ill I I I I I II III I I I I I I I I I I I I II 
Db 589 CAGGGGCACCTGGGGCTGGGGCCACGTCCTCTACAGGGTGCGCTCCGGCGTCCGCCGCCT 648 

Qy 505 CCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCG 564 

I II II III I I I I I I II I II I I I I I I I I I I I I I 

Db 649 CGTCGTGTCCGCGCACTCGCTGCCCACCACGCAGAGGCAGTCCCTGCCGCCGCCGGCGCA 708 

Qy 565 GAGCCCCACTGCGAAGCCCAGCTGCGCGC 593 

II I I I I III I I I I I I 

Db 709 CTGCAAGCCGGCGGCGCTCGCCCACGCGC 737 



RESULT 14 
US-11-293-697-208 

; Sequence 208, Application US/11293697 
; Publication No. US20060105376A1 
; GENERAL INFORMATION: 

; APPLICANT: HELIX RESEARCH INSTITUTE 

; TITLE OF INVENTION: Novel full length cDNA 

FILE REFERENCE: H1-A0106 
; CURRENT APPLICATION NUMBER: US/11/2 93,697 
; CURRENT FILING DATE: 2005-12-05 
; PRIOR APPLICATION NUMBER: US/10/108,260 
; PRIOR FILING DATE: 2002-03-28 
; NUMBER OF SEQ ID NOS: 5458 
; SOFTWARE: Patentln Ver . 2.1 
; SEQ ID NO 208 . 

LENGTH: 1354 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-11-293-697-208 

Query Match 5.9%; Score 37.4; DB 7; Length 1354; 

Best Local Similarity 53.0%; Pred. No. 1.5; 

Matches 80; Conservative 0; Mismatches 71; Indels 0; Gaps 0; 
Qy 451 GGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGC 510 
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Db 14 GTCGGCGGCTCCGGGCGCCTCCAGCTCCGGCTCGGGCTCCGGCTCCGGCTCCGGGCGCCG 73 

Qy 511 AGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCC 570 

I III I I I II I I I I I I I I I I I II I I I I I I I II I I 

Db 74 ACCAGGTCCCCTTGGCCCCCGCCGCGCGCCCTCCJUVCCTGCTGCTGCCGCCGCCGCGCCC 133 

Qy 571 C AC TGCGAAGC CC AGC TGC GC GC GC C T T GGG 601 

III I I I I I I I I I I I I III 

Db 134 CCCAGGCTCTGCCAGCTACCCGCGCTCTGAG 164 



RESULT 15 
US-11-312-958-63 

Sequence 63, Application US/11312958 
Publication No. US20060100152A1 
GENERAL INFORMATION: 
APPLICANT: Millennium Pharmaceuticals, Inc. 
APPLICANT: Rosenfeld, Julie Beth 
APPLICANT : Silos-Santiago, Inmaculada 

TITLE OF INVENTION: METHODS AND COMPOSITIONS IN TREATING 

TITLE OF INVENTION: PAIN AND PAINFUL DISORDERS USING 9949, 14230, 760, 62553, 
TITLE OF INVENTION: 12216, 17719, 41897, 47174, 33408, 10002, 16209, 314, 636, 
TITLE OF INVENTION: 27410, 33260, 619, 15985, 69112, 2158, 224, 615, 44373, 
TITLE OF INVENTION: 95431, 22245, 2387, 16658, 55054, 16314, 1613, 1675, 9569 OR 
TITLE OF INVENTION: 13424 MOLECULES 
FILE REFERENCE: MPI02-027P1RNOMNIM 
CURRENT APPLICATION NUMBER: US/11/312,958 
CURRENT FILING DATE: 2005-12-20 
PRIOR APPLICATION NUMBER: US/10/369,022 
PRIOR FILING DATE: 2003-02-19 
PRIOR APPLICATION NUMBER: US 60/360,495 
PRIOR FILING DATE: 2002-02-28 
PRIOR APPLICATION NUMBER: US 60/370,121 
PRIOR FILING DATE: 2002-04-04 
PRIOR APPLICATION NUMBER: US 60/373,010 
PRIOR FILING DATE: 2002-04-16 
PRIOR APPLICATION NUMBER: US 60/373,908 
PRIOR FILING DATE: 2002-04-19 
PRIOR APPLICATION NUMBER: US 60/377,717 
PRIOR FILING DATE: 2002-05-03 
PRIOR APPLICATION NUMBER: US 60/379,949 
PRIOR FILING DATE: 2002-05-13 
PRIOR APPLICATION NUMBER: US 60/382,409 
PRIOR FILING DATE: 2002-05-21 
PRIOR APPLICATION NUMBER: US 60/385,280 
PRIOR FILING DATE: 2002-06-03 
PRIOR APPLICATION NUMBER: US 60/386,879 
PRIOR FILING DATE: 2002-06-06 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 64 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 63 
LENGTH: 5703 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (213) . . . (2402) 
US-11-312-958-63 
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Query Match 5.9%; Score 37.4; DB 7; Length 5703; 

Best Local Similarity 61.6%; Pred. No. 2.1; 

Matches 77; Conservative 0; Mismatches 46; Indels 2; Gaps 1; 

Qy 442 CCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCC 501 

II II I II I I I I I I I I II I I I I I I I II I I I I I I I I I I 
Db 7 CCCTGCACTAGTGGCCGCAACCGAGACGCCGCGCTCCAGCAGCTGCTGCC — GCCCAGCC 64 

Qy 502 CCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGC 561 

I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I 

Db 65 CGGCCCCGCCGCCGCCCCCCAGCCCTGCAGCCCCGCAGCCCCGGCCGCGCCCAGCCCGGC 124 

Qy 562 GCGGA 566 

I I II 

Db 125 GAGGA 129 



Search completed: June 6, 2006, 22:25:21 
Job time : 2 6 sees 



SCORE 1.3 BuildDate: 12/06/2005 
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SCORE Search Results Details for Applicatior 
09869098 and Search Result us-09-869-098< 
l_copy__1634__2270.rnpbm. 



Score Home Page Retrieve Application List SCORE System Overview SCORE FAQ Comments / Sugg 

This page gives you Search Results detail for the Application 09869098 and Search Result us-09-8( 
l_copy_1634_2270.rnpbm. 

start 

Go Back to pn 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: June 6, 2006, 21:25:09 ; Search time 1201 Seconds 

(without alignments) 
6517.255 Million cell updates/sec 

Title: US-09-869-098A-l_COPY_1634_2270 
Perfect score: 637 

Sequence: 1 acctgttgctgcagttctga cgcccggctcgtccgacgcg 637 

Scoring table: I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 18892170 seqs, 6143817638 residues 

Total number of hits satisfying chosen parameters: 37784340 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



Published_Applic 

1: /EMC_Celerra 

2 : /EMC_Celerra 

3: /EMC_Celerra 

4 : /EMC_Celerra 

5 : /EMC_Celerra 

6: /EMC_Celerra 

7 : /EMC_Celerra 

8 : /EMC_Celerra 

9: /EMC_Celerra 

10: /EMC_Celerra 

11: /EMC_Celerra 

12: /EMC_Celerra 

13: /EMC_Celerra 

14: /EMC_Celerra 

15: /EMC_Celerra 

16: /EMC Celerra 



ations_NA_Main: * 

SIDS3/ptodata/2/pubpna/US07_PUBCOMB.seq:* 
SIDS3/ptodata/2/pubpna/US08_PUBCOMB.seq:* 
SIDS3/ptodata/2/pubpna/US09A_PUBCOMB.seq:* 
SIDS3/ptodata/2/pubpna/US09B_PUBCOMB.seq:* 
SIDS3/ptodata/2/pubpna/US09C_PUBCOMB. seq:* 
SIDS3/ptodata/2/pubpna/US10A_PUBCOMB. seq:* 
SIDS3/ptodata/2/pubpna/US10B_PUBCOMB. seq:* 
SIDS3 /ptodata /2 /pubpna/US 1 0C_PUBCOMB . seq : * 
SIDS3/ptodata/2/pubpna/US10D_PUBCOMB. seq:* 
_SIDS3/ptodata/2/pubpna/US10E_PUBCOMB.seq: 
_SIDS3/ptodata/2/pubpna/US10F_PUBCOMB.seq: 
_SIDS3/ptodata/2/pubpna/US10G_PUBCOMB.seq: 
_SIDS3/ptodata/2/pubpna/USllA_PUBCOMB.seq: 
SIDS3/ptodata/2/pubpna/USllB__PUBCOMB.seq: 
_SIDS3/ptodata/2/pubpna/USllC_PUBCOMB.seq: 
_SIDS3/ptodata/2/pubpna/USllD_PUBCOMB . seq: 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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7 


.6 


148083 


10 


US- 10-7 5 6-14 9-4571 


Sequence 4571, Ap 




23 


47 . 4 


7 


.4 


671 


6 


US- 10-184- 644-34 6 


Sequence 34 6, App 




O /I 

Z 4 


47.4 


7 


.4 


671 


6 


US- 10-184- 634-34 6 


Sequence 34 6, App 




o c 
Z 0 


4 / . 4 


7 


.4 


171096 


9 


tto in o o^n o 
US-lU-blZ-oby-o 


Sequence 8, Appli 


c 


0 c 
Z D 


/IT O 

4 / . Z 


7 


.4 


863 


8 


TTO 1 A '7C^ Tfil CO A o 

Ub— 1U— tot— /Ul— boUo 


Sequence 6303 , Ap 


c 


Z / 


/IT O 

4 / . Z 


7 


.4 


1534 


8 


T TO IA A Ol QCO TTTn 4 

US-lU-4^ / - y bo- / / / /4 


Sequence 777 74, A 




o o 
Z o 


4 / 


7 


.4 


1347 


8 


TTO IA TCT T A1 IIOAI 

US- ID- / b/ - / Ul-llz03 


Sequence 11203, A 


c 


O Q 

z y 


4 / 


7 


.4 


3725 


3 


tto no oio nin /r 

us— uy-oi /-yiu-b 


Sequence 6, Appli 


c 


O u 


^1 "7 
H I 


7 


.4 


3725 


7 


TTO in 1 Q A CAQ CO 

Ub— 1U— Xo4— b4o— bZ 


Sequence 62, Appl 


c 


oJ. 


A 1 
4 / 


7 


.4 


3725 


9 


tto ia tto coc oo 
Ub— 1U— / /Z — bob— Jo 


Sequence 33 , Appl 


c 


j z 


A 1 

4 / 


7 


.4 


3725 


16 


TTO 11 OQO A QO OO 

Ub— 11— Zoo— 4yj— OO 


Sequence 33, Appl 


c 


O O 


A C A 


7 


.3 


1447 


8 


tto i A /I A. C O QAOQ 

ub— ±u— 4o /— ybo— yuzy 


Sequence 9029, Ap 


Q 


34 


45.8 


7 


.2 


408 


10 


TT^I — 1 0— 4 SO — 7CO — 1 R9P.7 

(JO J.W i JU /DO ±*J£.0 1 


?ommnr*a 1 Q O Q *7 TV 

ocquencc iozo / , ±\ 




35 


45.6 


7 


.2 


594 


6 


US-10-123-155-10 


Sequence 10, Appl 




36 


45.6 


7 


.2 


594 


7 


US-10-146-731-10 


Sequence 10, Appl 




37 


45.6 


7 


.2 


594 


7 


US-10-140-472-10 


Sequence 10, Appl 




38 


45.6 


7 


.2 


594 


7 


US-10-141-761-10 


Sequence 10, Appl 




39 


45.6 


7 


.2 


594 


7 


US-10-142-885-10 


Sequence 10, Appl 




40 


45.6 


7 


.2 


594 


7 


US-10-158-790-10 


Sequence 10, Appl 




41 


45.6 


7 


.2 


594 


7 


US-10-137-871-10 


Sequence 10, Appl 




42 


45.6 


7 


.2 


594 


7 


US-10-140-923-10 


Sequence 10, Appl 




43 


45.6 


7 


.2 


594 


7 


US-10-141-756-10 


Sequence 10, Appl 




44 


45.6 


7 


.2 


594 


7 


US-10-141-759-10 


Sequence 10, Appl 




45 


45.6 


7 


.2 


594 


7 


US-10-140-805-10 


Sequence 10, Appl 



ALIGNMENTS 

RESULT 1 
US-10-197-019-1 
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Sequence 1, Application US/10197019 
Publication No. US20030207284A1 
GENERAL INFORMATION: 

APPLICANT: Chew, Anne 

APPLICANT: Denton, R. Rex 

APPLICANT: Gilson, Christopher Raleigh 

APPLICANT: Nandabalan, Krishnan 

APPLICANT: Parks, Katie E. 

TITLE OF INVENTION: HAPLOTYPES OF THE UCP2 GENE 
FILE REFERENCE: MWH-0042US 

CURRENT APPLICATION NUMBER: US/10/197,019 
CURRENT FILING DATE: 2002-07-16 
PRIOR APPLICATION NUMBER: PCT/US01/0248 5 
PRIOR FILING DATE: 2001-01-25 
NUMBER OF SEQ ID NOS : 116 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 1 

LENGTH: 9314 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/KEY: allele 
LOCATION: (1283) . . (1283) 

OTHER INFORMATION: PS1: polymorphic base cytosine or guanine 
FEATURE : 

NAME/KEY: allele 
LOCATION: (1714) . . (1714) 

OTHER INFORMATION: PS 2 : polymorphic base cytosine or thymine 
FEATURE : 

NAME/KEY: allele 
LOCATION: (2051) . . (2051) 

OTHER INFORMATION: PS3: polymorphic base thymine or cytosine 
FEATURE : 

NAME/KEY: allele 
LOCATION: (2124) . . (2124) 

OTHER INFORMATION: PS 4 : polymorphic base cytosine or thymine 
FEATURE : 

NAME/KEY: allele 
LOCATION: (2287) . . (2287) 

OTHER INFORMATION: PS 5 : polymorphic base cytosine or guanine 
FEATURE : 

NAME/KEY: allele 
LOCATION: (2408) . . (2408) 

OTHER INFORMATION: PS 6: polymorphic base adenine or guanine 
FEATURE : 

NAME/KEY: allele 
LOCATION: (4768) . . (4768) 

OTHER INFORMATION: PS7: polymorphic base adenine or guanine 
FEATURE : 

NAME/KEY: allele 
LOCATION: (4785) . . (4785) 

OTHER INFORMATION: PS 8 : polymorphic base guanine or adenine 
FEATURE : 

NAME/KEY: allele 
LOCATION: (4813) . . (4813) 

OTHER INFORMATION: PS 9 : polymorphic base thymine or cytosine 
FEATURE : 

NAME/KEY: allele 
LOCATION: (4882) . . (4882) 

OTHER INFORMATION: PS10: polymorphic base adenine or cytosine 
FEATURE : 
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NAME/KEY: allele 
LOCATION: (4976).. 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: allele 
LOCATION: (5600) . . 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: allele 
LOCATION: (5820).. 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: allele 
LOCATION: (6536).. 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: allele 
LOCATION: (6607).. 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: allele 
LOCATION: (6617).. 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: allele 
LOCATION: (6872) . . 
OTHER INFORMATION: 
FEATURE : 

NAME/ KEY : allele 
LOCATION: (6966).. 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: allele 
LOCATION: (7036).. 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: allele 
LOCATION: (7086) . . 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: allele 
LOCATION: (8100) . . 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: allele 
LOCATION: (8221).. 
OTHER INFORMATION: 
FEATURE : 

NAME/KEY: allele 
LOCATION: (8677).. 
OTHER INFORMATION: 
US-10-197-019-1 



(4976) 

PS11: polymorphic base thymine or adenine 



(5600) 

PS12: polymorphic base cytosine or thymine 



(5820) 

PS13: polymorphic base thymine or guanine 



(6536) 

PS14: polymorphic base thymine or adenine 



(6607) 

PS15: polymorphic base guanine or adenine 



(6617) 

PS16: polymorphic base cytosine or thymine 



(6872) 

PS17: polymorphic base cytosine or guanine 
(6966) 

PS18: polymorphic base guanine or adenine 
(7036) 

PS19: polymorphic base cytosine or thymine 
(7086) 

PS20: polymorphic base adenine or guanine 



(8100) 

PS21: polymorphic base cytosine or thymine 



(8221) 

PS22: polymorphic base guanine or adenine 



(8677) 

PS23: polymorphic base thymine or adenine 



Query Match 94.2%; 
Best Local Similarity 98.4%; 
Matches 627; Conservative 



Score 59 9.8; DB 7; 
Pred. No. 5.7e-164; 
0; Mismatches 7; 



Length 9314; 
Indels 3; 



Gaps 



2; 



Qy 



Db 



1 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
248 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 307 
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Qy 61 GACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCGCCCTGCAGGAGCCAGCCGCGCGCTCGC 120 

I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I 
Db 308 GACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCGCCCTGCAGGAGCCAGCCGCGCGCTCGC 367 

Qy 121 T C GCAGGAGGGT GGGT AGT T TGCC C AGC GT AGGGGGGC TGGGC C CATAAAAGAGGAAGTG 180 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 368 T CGCAGGAGGGT GGG T AGT T TGCC CAGC GT A- GGGGGC TGGGC C CATAAAAGAGGAAGTG 426 

Qy 181 CAC TT AAGAC AC GGC CC CGC TGGACGC T T GT T AGAAACCG TCC T GGC T GGGAAGGC AAGA 240 

I I I I II I I I I I I I I I I I I I I I II I II I I I I I I II I I I I I I I I I I II I I I I I 

Db 427 CAC TTAAGACACGGCCCAG — TGGACGCTGTTAGAAACCGTCCTGGCTGGGAAGGCAAGA 484 

Qy 241 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 300 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 485 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 544 

Qy 301 GC GGC C CGAGAGAGT GTGAGGCAGAGGC GGGG AGTGGC AAGGGAGTGAC CAT C TCGGGGA 360 

I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 545 GC GGC CCGAGAGAGT GTGAGGCAGAGGC GGGG AGTGGC AAGGGAGTGAC CAT C TCGGGGA 604 

Qy 361 AC GAAGGAGTAAACGCGGT GATGGGAC GC AC GGAAACGGGAGT GGAGAAAGT C ATGGAGA 420 

I I II II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 605 AC GAAGGAGTAAAC GCGGT GATGGGAC GC AC GGAAACGGGAGTGGAGAAAGT C ATGGAGA 664 

Qy 421 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 480 

I I I I I I I I I II I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 665 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 724 

Qy 481 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 540 

II I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I II I II I I I I II I I I I I I I I I I I I I I I I 

Db 725 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 784 

Qy 541 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 600 

I I I I I I I II I I I I I I I I I I I M I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 785 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 844 

Qy 601 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 845 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 881 



RESULT 2 

US-10-265-689-27 

Sequence 27, Application US/102 65689 
Publication No. US20030119775A1 
GENERAL INFORMATION: 
APPLICANT: SURWIT, RICHARD S. 
APPLICANT: COLLINS, SHEILA A. 
APPLICANT: WARDEN, CRAIG H. 
APPLICANT: SELDIN, MICHAEL F. 
APPLICANT: RICQUIER, DANIEL 
APPLICANT: BOUILLAUD, FREDERIC 

TITLE OF INVENTION: RESPIRATION UNCOUPLING PROTEIN 
FILE REFERENCE: 157 9-37 6 

CURRENT APPLICATION NUMBER: US/10/265,68 9 
CURRENT FILING DATE: 2002-10-08 
PRIOR APPLICATION NUMBER: US/09/353,645 
PRIOR FILING DATE: 1999-07-15 
PRIOR APPLICATION NUMBER: PCT/US97/06864 
PRIOR FILING DATE: 1997-04-22 
PRIOR APPLICATION NUMBER: 60/034,960 
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PRIOR FILING DATE: 1997-01-15 
NUMBER OF SEQ ID NOS : 47 
SOFTWARE: Patentln Ver . 2.1 
SEQ ID NO 27 
LENGTH: 1161 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

OTHER INFORMATION: "n" bases may be a, t, c, g, modified or unknown 
US-10-265-689-27 

Query Match 70.2%; Score 447.4; DB 7; Length 1161; 

Best Local Similarity 94.9%; Pred. No. 9.9e-120; 

Matches 516; Conservative 0; Mismatches 22; Indels 6; Gaps 5; 

Qy 100 AGGAGCCAGCCGCGCGCTCGCTCGCAGGAGGGTGGGTAGTTTGCCCAGCGT — AGGGGGG 157 

I II III I I I I I I III I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I 
Db 2 ANGAACCACCGGCGCGTTCGTTCGCAGGAGGTTGGTTAGTTTGCCCAGGGGTAAGGGGGG 61 

Qy 158 C TGGGC C C ATAAAAGAGGAAGTGC - AC T TAAGAC AC GGCCC C GC TGGACGC TTG T T AGAA 216 

I I I I I i I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 62 C TGGGC C CAT AAAAGAGGAAGTGCAACT TAAGAC AC GGCC CCCGTTGGACGCTGTT AGAA 121 

Qy 217 ACCGTCCT-GGCTGGGAAGGCAAGAGGTGTGTGACTGGACAAGACTTGTTTCT-GGCGGT 274 

III Mil II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 122 ACCTTCCTGGGTTGGGAAGGCAAGAGGTGTGTGACTGGACAAGAATTGTTTCTGGGCGGT 181 

Qy 275 CAGTCT TGCCATCCTCACAGAGGT TGGC GGCCCGAGAGAGTGTGAGGC AGAGGCGGGGAG 334 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 182 C AGTC T T GC CAT C C T CACAGAGGT TGGC GGCC CGAGAGAG TGTG AGGC AGAGGC GGGGAG 241 

Qy 335 TGGCAAGGGAG TGAC CATC TC GGGGAAC GAAGGAGT AAAC GC GG TGAT GGGAC GCAC GG- 393 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 242 TGGCAAGGGAG T GAC CATC T C GGGGAAC GAAGGAGT AAAC GC GG T GAT GGGACGCACGGA 301 

Qy 394 AAAC GGGAGTGGAGAAAGT C ATGGAGAGAAC C C T AGGCGGGGCGGTC C C C GC GGAAAGGC 453 

I I I I I I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 302 AAAC GGGAGTGGAGAAAGT C ATGGAGAGAAC C T T AGGCGGGGC GGTC C C C GCGGAAAGGC 361 

Qy 454 GGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGG 513 

I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 362 GGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGG 421 

Qy 514 CCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCAC 573 

I I I I I I I I I I I I I I I I I I I I I I I I I I II 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 422 CCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCAC 481 

Qy 574 TGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGA 633 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 482 TGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGA 541 

Qy 634 CGCG 637 

I I I I 

Db 542 CGCG 54 5 



RESULT 3 

US-10-311-455-1865 

; Sequence 1865, Application US/10311455 
; Publication No. US20030143606A1 
; GENERAL INFORMATION: 
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APPLICANT: OLEK, Alexander 
APPLICANT: PIEPENBROCK, Christian 
APPLICANT: BERLIN, Kurt 

TITLE OF INVENTION: Diagnosis of Diseases Associated with the Immune System by Dete 
TITLE OF INVENTION: cytosine methylation 
FILE REFERENCE: 5013.1014 

CURRENT APPLICATION NUMBER: US/10/311,4 55 
CURRENT FILING DATE: 2002-12-16 
PRIOR APPLICATION NUMBER: PCT/EP01/07537 
PRIOR FILING DATE: 2001-07-02 
PRIOR APPLICATION NUMBER: DE 10032529.7 
PRIOR FILING DATE: 2000-06-30 
PRIOR APPLICATION NUMBER: DE 10043826.1 
PRIOR FILING DATE: 2000-09-01 
NUMBER OF SEQ ID NOS : 2424 
SEQ ID NO 1865 
LENGTH: 5283 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: chemically treated genomic DNA (Homo sapiens) 
US-10-311-455-1865 

Query Match 57.2%; Score 364.2; DB 7; Length 5283; 

Best Local Similarity 77.4%; Pred. No. 1.6e-95; 

Matches 4 67; Conservative 0; Mismatches 133; Indels 3; Gaps 2; 

Qy 35 C GAC AAC GC GGC GGC TGTAAC CAATCGAC AGC GAGGCCGG TCGC GAGGC CC C AGTC C CGC 94 

III I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I III II 

Db 2 CGATAACGCGGCGGTTGTAATTAATCGATAGCGAGGTCGGTCGCGAGGTTTTAGTTTCGT 61 

Qy 95 C C TGC AGGAGC C AGC C GC GCGC TC GC TC GC AGGAGGGTGGGTAG TT T GC CC AGC GT AGGG 154 

II I I I I I II I I I I I I III III I I I I II I I I I I I I I I I I I II I I II II 
Db 62 T T TGT AGGAGT TAGT C GC GC GT T C GT TC GTAGGAGGGTGGGT AG TT T GT T TAGC GT A- GG 120 

Qy 155 GGGC T GGGC CC ATAAAAGAGGAAGTGCAC T TAAGAC ACGGCC CC GC TGGACGC T TGT TAG 214 

III I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 GGGT TGGGT T T ATAAAAGAGGAAGTGTAT T TAAGAT ACGG T T TAG — TGGAC GT TG T TAG 178 

Qy 215 AAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTGACTGGACAAGACTTGTTTCTGGCGGT 274 

III III III I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 179 AAATCGTTTTGGTTGGGAAGGTAAGAGGTGTGTGATTGGATAAGATTTGTTTTTGGCGGT 238 

Qy 275 C AGT C T T GCC ATCC T C AC AGAGGT TGGC GGCC CGAGAGAGTGTGAGGC AGAGGC GGGGAG 334 

III II I II I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I 
Db 239 TAGTTTT GTTATTT T TATAGAGGTTGGCGGTT CGAGAGAGTGTGAGGT AGAGGC GGGGAG 298 

Qy 335 TGGCAAGGGAGTGAC CATC TCGGGGAAC GAAGGAGTAAACGCGGTGATGGGACGCACGGA 394 

III I I I I I I II I I II I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 299 TGGTAAGGGAGTGAT TAT T TCGGGGAAC GAAGGAGTAAACGCGGTGATGGGACGTACGGA 358 

Qy 395 AACGGGAGTGGAGAAAGTCATGGAGAGAACCC TAGGCGGGGCGGTCCC CGCGGAAAGGCG 454 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I 
Db 359 AACGGGAGT GGAGAAAGT T ATGGAGAGAAT TT T AGGCGGGGCGG T T T T C GC GGAAAGGC G 418 

Qy 455 GCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGC 514 

I I I I I I I I I I I I I I I II I I I I I I I I I I I III II I I I I I 
Db 419 GTTGTTTTAGGGTTTTCGTATTTAAGTAGGAGTTGGTAGGTTCGGTTTCGTTTCGTAGGT 478 

Qy 515 CCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACT 574 

I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I II 
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Db 47 9 TTTATTTCGGGTTTCGTTTTCGAGGTTTAAGTCGCGTCGTCGTTTGCGCGGAGTTTTATT 538 

Qy 575 GC GAAGC CC AGC T GC GC GC GCC T T GGGAT T GAC T GT CC AC GC TC GC CC GGC T C GTCC GAC 634 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 539 GCGAAGTTTAGTTGCGCGCGTTTTGGGATTGATTGTTTACGTTCGTTCGGTTCGTTCGAC 598 

Qy 635 GCG 637 

I I I 

Db 599 GCG 601 



RESULT 4 

US-10-311-455-1866/C 

Sequence 1866, Application US/10311455 
Publication No. US20030143606A1 
GENERAL INFORMATION: 
APPLICANT: OLEK, Alexander 
APPLICANT: PIEPENBROCK, Christian 
APPLICANT: BERLIN, Kurt 

TITLE OF INVENTION: Diagnosis of Diseases Associated with the Immune System by Dete 
TITLE OF INVENTION: cytosine methylation 
FILE REFERENCE: 5013.1014 

CURRENT APPLICATION NUMBER: US/10/311,4 55 
CURRENT FILING DATE: 2002-12-16 
PRIOR APPLICATION NUMBER: PCT/EP01/07537 
PRIOR FILING DATE: 2001-07-02 
PRIOR APPLICATION NUMBER: DE 10032529.7 
PRIOR FILING DATE: 2000-06-30 
PRIOR APPLICATION NUMBER: DE 10043826.1 
PRIOR FILING DATE: 2000-09-01 
NUMBER OF SEQ ID NOS : 2424 
SEQ ID NO 1866 
LENGTH: 5283 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: chemically treated genomic DNA (Homo sapiens) 
US-10-311-455-1866 

Query Match 49.0%; Score 312.4; DB 7; Length 5283; 

Best Local Similarity 72.0%; Pred. No. 1.8e-80; 

Matches 435; Conservative 0; Mismatches 166; Indels 3; Gaps 2; 

Qy 34 CCGACAACGCGGCGGCTGTAACCAATCGACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCG 93 

I I I I I I I I I I I II II I I I I I I I I I I I I I III III I II I I I I I II I I I I I I 
Db 5283 C CG AC AACGCGACGAC TAT AACCAATCGACAAC GAAAC C GATC GCGAAAC C C C AAT C C C G 5224 

Qy 94 CCCTGCAGGAGCCAGCCGCGCGCTCGCTCGCAGGAGGGTGGGTAGTTTGCCCAGCGTAGG 153 

I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 5223 CCCTACAAAAACCAACCGCGCGCTCGCTCGCAAAAAAATAAATAATTTACCCAACGTA-A 5165 

Qy 154 GGGGC T GGGC C CATAAAAGAGGAAGTGC AC T T AAGAC AC GGC C C CGC T GGAC GC T TGT T A 213 

II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 5164 AAAAC T AAAC C C AT AAAAAAAAAAAT AC AC T T AAAAC AC GAC CC — AAT AAACGC TAT T A 5107 

Qy 214 GAAAC C G T CC T GGC T GGGAAGGCAAGAGGTGT GTGAC TGGACAAGAC T TGT T TC TGGCGG 273 

I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I II 
Db 5106 AAAAC CGT CC T AAC T AAAAAAACAAAAAAT AT ATAAC TAAACAAAAC T TAT T TC TAACGA 5047 

Qy 274 TC AGTCT TGCC ATCC TCAC AGAGGT TGGCGGC CCGAGAGAGTGT GAGGCAGAGGCGGGGA 333 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I III II I 
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Db 504 6 TCAATC T TAC C ATCC TC AC AAAAAT TAACGAC CC GAAAAAAT AT AAAACAAAAACGAAAA 4987 

Qy 334 GTGGCAAGGGAGTGACC AT C TCGGGGAACGAAGGAGTAAACGCGGTGATGGGACGC ACGG 393 

I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I 
Db 4986 AT AAC AAAAAAAT AAC CAT C T C GAAAAAC GAAAAAAT AAAC GCG AT AAT AAAAC GC ACGA 4927 

Qy 394 AAACGGGAGTGGAGAAAGTCATGGAGAGAACC C TAGGCGGGGCGGTCCCCGCGGAAAGGC 453 

II I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I 
Db 4926 AAAC GAAAAT AAAAAAAAT C AT AAAAAAAAC C C T AAAC GAAAC GAT C C C C GC G AAAAAAC 4867 

Qy 454 GGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGG 513 

I I I I I I II I I I I I I I I I I I I I I I III II I I I I I I I I I I I I I I I I 
Db 4866 GAC TAC T C CAAAATC TCCGC ACC C AAAT AAAAAC TAAC AAACCC GACC CCGC CC C GCAAA 4807 

Qy 514 CCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCAC 573 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4806 CCCCACCCCGAACCCCGCCCCCGAAACTTAAACCGCGCCGCCGCCTACGCGAAACCCCAC 4747 

Qy 574 TGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGA 633 

I I I I I I I I I II I I I I I I I I I I III III I I I I I I I I I I I I I I I I I I I I I I I 
Db 4746 TACGAAACCCAACTACGCGCGCCTTAAAATTAACTATCCACGCTCGCCCGACTCGTCCGA 4687 

Qy 634 CGCG 637 

I I I I 

Db 4686 CGCG 4683 



RESULT 5 

US-10-159-563-344 

; Sequence 344, Application US/10159563 

; Publication No. US20040009154A1 

; GENERAL INFORMATION: 

; APPLICANT: Khan, Javed 

; APPLICANT: Ringner, Markus 

; APPLICANT: Peterson, Carsten 

; APPLICANT: Meltzer, Paul 

; TITLE OF INVENTION: SELECTIONS OF GENES AND METHODS OF USING THE SAME FOR 

; TITLE OF INVENTION: DIAGNOSIS AND FOR TARGETING THE THERAPY OF SELECT CANCERS 

; FILE REFERENCE: 11613. 56USI1 

; CURRENT APPLICATION NUMBER: US/10/159,563 

; CURRENT FILING DATE: 2002-12-09 

; PRIOR APPLICATION NUMBER: US 10/133,937 

; PRIOR FILING DATE: 2002-04-25 

; NUMBER OF SEQ ID NOS : 444 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 344 
LENGTH: 164 6 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-159-563-344 

Query Match 10.5%; Score 67; DB 7; Length 1646; 

Best Local Similarity 100.0%; Pred. No. 3.3e-09; 

Matches 67; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 571 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 630 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 1 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 60 

Qy 631 CGACGCG 637 

I I I I I I I 
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Db 61 CGACGCG 67 



RESULT 6 

US-10-287-436A-43 

; Sequence 43, Application US/10287436A 
; Publication No. US20050202421A1 
; GENERAL INFORMATION: 

; APPLICANT: CHILDREN'S HOSPITAL MEDICAL CENTER 

; TITLE OF INVENTION: METHOD FOR DIAGNOSIS AND TREATMENT OF 

; TITLE OF INVENTION: RHEUMATOID ARTHRITIS 

; FILE REFERENCE: 10872.514696 

; CURRENT APPLICATION NUMBER: US/10/287 , 4 36A 

; CURRENT FILING DATE: 2002-10-31 

; PRIOR APPLICATION NUMBER: US 60/336,220 

; PRIOR FILING DATE: 2001-10-31 

; NUMBER OF SEQ ID NOS : 1446 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 43 

LENGTH: 164 6 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-287-436A-43 



Query Match - 10.5%; Score 67; DB 10; Length 1646; 

Best Local Similarity 100.0%; Pred. No. 3.3e-09; 

Matches 67; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 571 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 630 

I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I i I I I I I I I I I 
Db 1 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 60 



Qy 631 CGACGCG 637 

I I I I I I I 
Db 61 CGACGCG 67 



RESULT 7 

US-10-631-467-426 

; Sequence 426, Application US/10631467 
; Publication No. US20050208496A1 
; GENERAL INFORMATION: 

APPLICANT: Genox Research Inc. 
; TITLE OF INVENTION: Method for testing for broncheal asthma, or chronic obstructive 
; TITLE OF INVENTION: disease 
; FILE REFERENCE: 34 62.1005-000 
; CURRENT APPLICATION NUMBER: US/10/631,4 67 
; CURRENT FILING DATE: 2003-07-31 
; PRIOR APPLICATION NUMBER: JP 2003-077212 
; PRIOR FILING DATE: 2003-03-20 
; PRIOR APPLICATION NUMBER: JP 2002-229312 
; PRIOR FILING DATE: 2002-08-06 
; NUMBER OF SEQ ID NOS: 2086 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 426 

LENGTH: 164 6 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-631-467-426 

Query Match 10.5%; Score 67; DB 10; Length 1646; 
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Best Local Similarity 100.0%; Pred. No. 3.3e-09; 



Matches 67; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 571 C AC TGC GAAGC C C AGC TGC GCGC GC C T T GGGAT TGAC T GTCCACGC TC GC C C GGC T CGTC 630 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 60 

Qy 631 CGACGCG 637 

I I I I I I I 
Db 61 CGACGCG 67 



RESULT 8 

US-11-169-041-97 

; Sequence 97, Application US/11169041 
; Publication No. US20060019284A1 
; GENERAL INFORMATION: 

; APPLICANT: Bristol-Myers Squibb Company 

; TITLE OF INVENTION: IDENTIFICATION OF POLYNUCLEOTIDES FOR PREDICTING ACTIVITY OF 
; TITLE OF INVENTION: COMPOUNDS THAT INTERACT WITH AND/OR MODULATE PROTEIN TYROSINE 
; TITLE OF INVENTION: KINASES AND /OR PROTEIN TYROSINE KINASE PATHWAYS IN LUNG CANCER 
; TITLE OF INVENTION: CELLS 
; FILE REFERENCE: 10001 NP 

; CURRENT APPLICATION NUMBER: US/11/169,041 

; CURRENT FILING DATE: 2005-06-28 

; PRIOR APPLICATION NUMBER: 60/584,405 

; PRIOR FILING DATE: 2004-06-30 

; NUMBER OF SEQ ID NOS : 527 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 97 

LENGTH: 164 6 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-11-169-041-97 

Query Match 10.5%; Score 67; DB 16; Length 1646; 

Best Local Similarity 100.0%; Pred. No. 3.3e-09; 

Matches 67; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 571 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 630 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 60 

Qy 631 CGACGCG 637 

I I I I I I I 
Db 61 CGACGCG 67 



RESULT 9 

US-10-287-436A-673 

; Sequence 673, Application US/10287436A 

; Publication No. US20050202421A1 

; GENERAL INFORMATION: 

; APPLICANT: CHILDREN'S HOSPITAL MEDICAL CENTER 

; TITLE OF INVENTION: METHOD FOR DIAGNOSIS AND TREATMENT OF 

; TITLE OF INVENTION: RHEUMATOID ARTHRITIS 

; FILE REFERENCE: 10872.514696 

; CURRENT APPLICATION NUMBER: US/10/287 , 4 36A 

; CURRENT FILING DATE: 2002-10-31 

; PRIOR APPLICATION NUMBER: US 60/336,220 

; PRIOR FILING DATE: 2001-10-31 
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NUMBER OF SEQ ID NOS : 1446 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 673 
LENGTH: 8177 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-287-436A-673 

Query Match 10.5%; Score 67; DB 10; Length 8177; 

Best Local Similarity 100.0%; Pred. No. 3.5e-09; 

Matches 67; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 571 CAC T GCGAAGC CCAGC TGC GCGC GC C TT GGGAT TGAC TGTC C AC GC T C GC CC GGC T CGT C 630 

I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I! I I I I I 
Db 1 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 60 

Qy 631 CGACGCG 637 

I I I I I I I 
Db 61 CGACGCG 67 



RESULT 10 

US-10-301-480-268288/C 

; Sequence 268288, Application US/10301480 

; Publication No. US20060057564A1 

; GENERAL INFORMATION: 

; APPLICANT: Wang, David G. 

TITLE OF INVENTION: Identif iction and Mapping of Single Nucleotide Polymorphisms 
; TITLE OF INVENTION: in the Human Genome 
; FILE REFERENCE: 108827.137 

; CURRENT APPLICATION NUMBER: US/10/301,480 

; CURRENT FILING DATE: 2002-11-21 

; PRIOR APPLICATION NUMBER: US 10/215,598 

; PRIOR FILING DATE: 2002-08-09 

; PRIOR APPLICATION NUMBER: US 60/311,695 

; PRIOR FILING DATE: 2001-08-10 

; NUMBER OF SEQ ID NOS: 1226818 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 2 68288 

LENGTH: 561 

TYPE: DNA 
; ORGANISM: Homo sapien 
US-10-301-480-268288 



Query Match 9.4%; Score 60; DB 12; Length 561; 

Best Local Similarity 100.0%; Pred. No. 3.4e-07; 

Matches 60; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 60 

I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 60 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 1 



RESULT 11 

US-10-301-480-881697/C 

; Sequence 881697, Application US/10301480 

; Publication No. US20060057564A1 

; GENERAL INFORMATION: 

; APPLICANT: Wang, David G. 

TITLE OF INVENTION: Identif iction and Mapping of Single Nucleotide Polymorphisms 
; TITLE OF INVENTION: in the Human Genome 
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; FILE REFERENCE: 108827.137 

; CURRENT APPLICATION NUMBER: US/10/301,480 

; CURRENT FILING DATE: 2002-11-21 

; PRIOR APPLICATION NUMBER: US 10/215,598 

; PRIOR FILING DATE: 2002-08-09 

; PRIOR APPLICATION NUMBER: US 60/311,695 

; PRIOR FILING DATE: 2001-08-10 

; NUMBER OF SEQ ID NOS : 1226818 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 881697 

LENGTH: 561 

TYPE: DNA 
; ORGANISM: Homo sapien 
US-10-301-480-881697 

Query Match 9.4%; Score 60; DB 12; Length 561; 

Best Local Similarity 100.0%; Pred. No. 3.4e-07; 

Matches 60; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 60 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 1 



RESULT 12 

US-10-062-674-1405 

; Sequence 1405, Application US/10062674 
; Publication No. US20040005559A1 
; GENERAL INFORMATION: 

; APPLICANT: Loring, Jeanne F.; Kaser, Matthew R. 

; TITLE OF INVENTION: MARKERS OF NEURONAL DIFFERENTIATION AND MORPHOGENESIS 

; FILE REFERENCE: PA-002 6-1 CTP 

; CURRENT APPLICATION NUMBER: US/10/062,674 

; CURRENT FILING DATE: 2002-01-30 

; PRIOR APPLICATION NUMBER: US 09/625,102 

; PRIOR FILING DATE: 2000-07-24 

; NUMBER OF SEQ ID NOS: 2217 

; SOFTWARE: PERL Program 

; SEQ ID NO 1405 

LENGTH: 1725 

TYPE: DNA 
; ORGANISM: Homo sapiens 

FEATURE : 
; NAME/KEY: misc_feature 

OTHER INFORMATION: Incyte ID No. US20040005559A1 093687.7 

FEATURE : 

NAME/KEY: unsure 
LOCATION: (1) ... (1725) 

OTHER INFORMATION: a, t, c, g, or other 
US-10-062-674-1405 

Query Match 9.3%; Score 59; DB 7; Length 1725; 

Best Local Similarity 100.0%; Pred. No. 7e-07; 

Matches 59; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 579 AGC CC AGC T GC GCGC GC C T TGGGAT TGAC TGT CC AC GC TC GCC C GGC T CGTC C GACGCG 637 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 AGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 59 



RESULT 13 
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US-10-240-965-183 

Sequence 183, Application US/10240965 
Publication No. US20030165924A1 
GENERAL INFORMATION: 
APPLICANT: INCYTE GENOMICS , INC. 
APPLICANT: SHIFFMAN, Dov 
APPLICANT: SOMOGYI, Roland 
APPLICANT: LAWN, Richard M. 
APPLICANT: SEILHAMER, Jeffrey J. 
APPLICANT: PORTER, Gordon J. 
APPLICANT: MIKITA, Thomas 
APPLICANT : TAI , Julie 

TITLE OF INVENTION: GENES EXPRESSED IN FOAM CELL DIFFERENTIATION 
FILE REFERENCE: PA-0025 PCT 

CURRENT APPLICATION NUMBER: US/10/240,965 
CURRENT FILING DATE: 2002-10-04 
PRIOR APPLICATION NUMBER: 60/195,106 
PRIOR FILING DATE: 2000-04-05 
NUMBER OF SEQ ID NOS : 27 6 
SOFTWARE: PERL Program 
SEQ ID NO 183 
LENGTH: 1643 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/KEY : misc_f eatur e 

OTHER INFORMATION: Incyte ID No. US20030165924A1 093687.6 
US-10-240-965-183 

Query Match 9.1%; Score 58; DB 7; Length 164 3; 

Best Local Similarity 100.0%; Pred. No. 1.4e-06; 

Matches 58; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 580 GCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I 
Db 1 GCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 58 



RESULT 14 

US-09-925-065A-177131/C 

; Sequence 177131, Application US/09925065A 

; Publication No. US20040181048A1 

; GENERAL INFORMATION: 

; APPLICANT: Wang, David G. 

TITLE OF INVENTION: Identification and Mapping of Single 

TITLE OF INVENTION: Nucleotide Polymorphisms in the Human Genome 

; FILE REFERENCE: 108827.135 

; CURRENT APPLICATION NUMBER: US/09/925, 065A 

; CURRENT FILING DATE: 2001-08-08 

; PRIOR APPLICATION NUMBER: US 60/243,096 

; PRIOR FILING DATE: 2000-10-24 

; PRIOR APPLICATION NUMBER: US 60/252,147 

; PRIOR FILING DATE: 2000-11-20 

; PRIOR APPLICATION NUMBER: US 60/250,092 

; PRIOR FILING DATE: 2000-11-30 

; PRIOR APPLICATION NUMBER: US 60/261,766 

; PRIOR FILING DATE: 2001-01-16 

; PRIOR APPLICATION NUMBER: US 60/289,846 

; PRIOR FILING DATE: 2001-05-09 

; NUMBER OF SEQ ID NOS: 957086 

SOFTWARE: FastSEQ for Windows Version 4.0 
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; SEQ ID NO 177131 
LENGTH: 554 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-925-065A-177131 

Query Match 8.3%; Score 53; DB 4; Length 554; 

Best Local Similarity 100.0%; Pred. No. 3.7e-05; 

Matches 53; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTA 53 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 53 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTA 1 



RESULT 15 

US-09-925-065A-177131/C 

; Sequence 177131, Application US/09925065A 

; Publication No. US20050228172A9 

; GENERAL INFORMATION: 

; APPLICANT: Wang, David G. 

; TITLE OF INVENTION: Identification and Mapping of Single 

; TITLE OF INVENTION: Nucleotide Polymorphisms in the Human Genome 

; FILE REFERENCE: 108827.135 

; CURRENT APPLICATION NUMBER: US/09/925 , 065A 

; CURRENT FILING DATE: 2001-08-08 

; PRIOR APPLICATION NUMBER: US 60/243,096 

; PRIOR FILING DATE: 2000-10-24 

; PRIOR APPLICATION NUMBER: US 60/252,147 

; PRIOR FILING DATE: 2000-11-20 

; PRIOR APPLICATION NUMBER: US 60/250,092 

; PRIOR FILING DATE: 2000-11-30 

; PRIOR APPLICATION NUMBER: US 60/261,766 

; PRIOR FILING DATE: 2001-01-16 

; PRIOR APPLICATION NUMBER: US 60/289,846 

; PRIOR FILING DATE: 2001-05-09 

; NUMBER OF SEQ ID NOS : 957086 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 177131 

LENGTH: 554 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-09-925-065A-177131 

Query Match 8.3%; Score 53; DB 5; Length 554; 

Best Local Similarity 100.0%; Pred. No. 3.7e-05; 

Matches 53; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTA 53 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 53 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTA 1 



Search completed: June 6, 2006, 22:54:14 
Job time : 1203 sees 



SCORE 1.3 BuildDate: 12/06/2005 
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SCORE Search Results Details for Application 
09869098 and Search Result us-09-869-098a 
l__copy__1634__2270.rni- 



Score Home Page Retrieve Application List SCORE System Overview SCORE FAQ Comments / Sugg 

This page gives you Search Results detail for the Application 09869098 and Search Result us-09-8( 
l_copy_1634_2270.rni. 

start 

Go Back to pre' 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 

OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



June 6, 2006, 20:46:25 ; Search time 176 Seconds 

(without alignments) 
6772.138 Million cell updates/sec 

US-09-869-098A-l_COPY_1634_2270 
637 

1 acctgttgctgcagttctga cgcccggctcgtccgacgcg 637 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 1403666 seqs, 935554401 residues 

Total number of hits satisfying chosen parameters: 



2807332 



Minimum DB seq length: 
Maximum DB seq length: 



2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



Issued Patents NA:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 



/EMC_Celerra_SIDS3 
/EMC_Celerra_SIDS3 
/EMC_Celerra_SIDS3 
/EMC_Celerra_SIDS3 
/EMC_Celerra_SIDS3 
/EMC_Celerra_SIDS3 
/EMC_Celerra_SIDS3 
/EMC_Celerra_SIDS3 
/EMC_Celerra_SIDS3 
/EMC Celerra SIDS 



/ptodata/2 /ina/l_COMB . seq : * 
/ptodata/2 /ina/5_COMB . seq : * 
/ptodata/2/ina/6A_COMB . seq : * 
/ptodata/2 /ina/6B_COMB . seq : * 
/ptodata/2/ina/7_COMB. seq: * 
/ptodata/2 /ina/H_COMB . seq : * 
/ptodata/2/ina/PCTUS_COMB.seq:* 
/ptodata/2 /ina/PP_COMB . seq : * 
/ptodata/2/ina/RE_COMB. seq: * 
3/ptodata/2/ina/backf ilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 
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Result Query 

No. Score Match Length DB ID Description 





1 


625 


98. 


1 


11808 


3 


US- 


09- 


949- 


016-15281 


Sequence 


15281, A 




2 


625 


98. 


1 


39754 


3 


us- 


09- 


949- 


016-14689 


Sequence 


14689, A 




3 


485 


76. 


1 


736 


2 


us- 


08- 


846- 


012A-1 


Sequence 


1 , Appli 




4 


485 


76. 


1 


736 


2 


us- 


09- 


100- 


297-1 


Sequence 


1, Appli 




5 


92.4 


14. 


5 


2705 


3 


us- 


09- 


949- 


016-2947 


Sequence 


2947 


, Ap 


c 


6 


60 


9. 


4 


7218 


2 


us- 


08- 


232- 


463-14 


Sequence 


14, 


Appl 




7 


57 


8. 


9 


161 


3 


us- 


09- 


513- 


999C-14461 


Sequence 


14461, A 


c 


8 


51 


8. 


0 


1401 


4 


us- 


09- 


605- 


703B-543 


Sequence 


543, 


App 




9 


48.4 


7. 


6 


69834 


3 


us- 


09- 


949- 


016-11959 


Sequence 


11959, A 




10 


48.4 


7. 


6 


69834 


3 


us- 


09- 


949- 


016-12925 


Sequence 


12925, A 


c 


11 


45.4 


7. 


1 


70014 


3 


us- 


09- 


949- 


016-17110 


Sequence 


17110, A 




12 


44.8 


7. 


0 


27270 


3 


us- 


09- 


949- 


016-13822 


Sequence 


13822, A 




13 


44.6 


7. 


0 


154746 


3 


us- 


09- 


827- 


688-8 


Sequence 


8 , Appli 


c 


14 


44.6 


7. 


0 


154746 


3 


us- 


09- 


827- 


688-8 


Sequence 


8, Appli 


c 


15 


44.4 


7. 


0 


670689 


3 


us- 


09- 


949- 


016-12505 


Sequence 


12505, A 


c 


16 


44.4 


7. 


0 


670690 


3 


us- 


09- 


949- 


016-14207 


Sequence 


14207, A 


c 


17 


44 


6. 


9 


63183 


3 


us- 


09- 


949- 


016-13047 


Sequence 


13047, A 


c 


18 


44 


6. 


9 


63183 


3 


us- 


09- 


949- 


016-13048 


Sequence 


13048, A 


c 


19 


43.8 


6. 


9 


43058 


4 


us- 


09- 


880- 


107-3950 


Sequence 


3950 


, Ap 


c 


20 


43.2 


6. 


8 


2592 


3 


us- 


09- 


949- 


016-2162 


Sequence 


2162 


, Ap 




21 


43.2 


6. 


8 


29930 


3 


us- 


09- 


949- 


016-15326 


Sequence 


15326, A 


c 


22 


43.2 


6. 


8 


43690 


3 


us- 


09- 


949- 


016-13904 


Sequence 


13904, A 




23 


43 


6. 


8 


9293 


3 


us- 


09- 


949- 


016-16801 


Sequence 


16801, A 




24 


42.8 


6. 


7 


2561 


3 


us- 


09- 


616- 


289-48 


Sequence 


48, 


Appl 




25 


42.8 


6. 


7 


2561 


3 


us- 


09- 


976- 


740-48 


Sequence 


48, 


Appl 


c 


26 


41.8 


6. 


6 


4563 


3 


us- 


10- 


012- 


231A-259 


Sequence 


259, 


App 


c 


27 


41.8 


6. 


6 


4563 


3 


us- 


10- 


015- 


389A-259 


Sequence 


259, 


App 


c 


28 


41.8 


6. 


6 


4563 


3 


us- 


10- 


006- 


768A-259 


Sequence 


259, 


App 


c 


29 


41.8 


6. 


6 


4563 


3 


us- 


10- 


015- 


671A-259 


Sequence 


259, 


App 


c 


30 


41.8 


6. 


6 


4563 


3 


us- 


10- 


015- 


393A-259 


Sequence 


259, 


App 


c 


31 


41.8 


6. 


6 


4563 


3 


us- 


10- 


011- 


833A-259 


Sequence 


259, 


App 


c 


32 


41.8 


6. 


6 


4563 


3 


us- 


10- 


006- 


041A-259 


Sequence 


259, 


App 


c 


33 


41.8 


6. 


6 


4563 


3 


us- 


10- 


012- 


064A-259 


Sequence 


259, 


App 


c 


34 


41.8 


6. 


6 


4563 


4 


us- 


10- 


015- 


392A-259 


Sequence 


259, 


App 


c 


35 


41.8 


6. 


6 


4563 


5 


us- 


10- 


011- 


795B-259 


Sequence 


259, 


App 


c 


36 


41.8 


6. 


6 


4563 


5 


us- 


10- 


015- 


386A-259 


Sequence 


259, 


App 


c 


37 


41.8 


6. 


6 


4563 


5 


us- 


10- 


012- 


121A-259 


Sequence 


259, 


App 


c 


38 


41.8 


6. 


6 


4563 


5 


us- 


10- 


006- 


485A-259 


Sequence 


259, 


App 


c 


39 


41.8 


6. 


6 


4563 


5 


us- 


10- 


006- 


746A-259 


Sequence 


259, 


App 


c 


40 


41.8 


6. 


6 


4563 


5 


us- 


10- 


012- 


752A-259 


Sequence 


259, 


App 


c 


41 


41.8 


6. 


6 


4563 


5 


us- 


10- 


017- 


253A-259 


Sequence 


259, 


App 


c 


42 


41.8 


6. 


6 


4563 


5 


us- 


10- 


015- 


519A-259 


Sequence 


259, 


App 


c 


43 


41.8 


6. 


6 


4563 


5 


us- 


10- 


015- 


715A-259 


Sequence 


259, 


App 


c 


44 


41.8 


6. 


6 


4563 


5 


us- 


10- 


007- 


236A-259 


Sequence 


259, 


App 




45 


41.8 


6. 


6 


19152 


3 


us- 


09- 


949- 


016-12110 


Sequence 


12110, A 



ALIGNMENTS 



RESULT 1 

US-09-949-016-15281 

; Sequence 15281, Application US/09949016 

; Patent No. 6812339 

; GENERAL INFORMATION: 

; APPLICANT: VENTER, J. Craig et al. 

; TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

; TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES THEREOF 
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FILE REFERENCE: CL001307 

CURRENT APPLICATION NUMBER: US/09/94 9,016 
CURRENT FILING DATE: 2000-04-14 
PRIOR APPLICATION NUMBER: 60/241,755 
PRIOR FILING DATE: 2000-10-20 
PRIOR APPLICATION NUMBER: 60/237,768 
PRIOR FILING DATE: 2000-10-03 
PRIOR APPLICATION NUMBER: 60/231,498 
PRIOR FILING DATE: 2000-09-08 
NUMBER OF SEQ ID NOS : 207012 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 15281 
LENGTH: 11808 
TYPE: DNA 
ORGANISM: Human 
US-09-949-016-15281 



Query Match 98;i%; 
Best Local Similarity 99.8%; 
Matches 63 6; Conservative 



Score 625; DB 3; Length 11808; 
Pred. No. 1.8e-131; 
0; Mismatches 0; Indels 1; 



Gaps 



1; 



Qy 



Db 



1 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 60 
I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
1402 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 14 61 



Qy 

Db 

Qy 

Db 

Qy 

Db 



61 GAC AGC GAGGC CGGT C GCGAGGC C CCAG TC CC GC C C T GC AGGAGCC AGC CGC GC GC T CGC 120 

I I I II I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
14 62 GACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCGCCCTGCAGGAGCCAGCCGCGCGCTCGC 1521 



121 



180 



T C GC AGGAGGG T GGGTAGT T T GCC C AGC GT AGGGGGGC TGGGCC C ATAAAAGAGGAAGT G 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1522 TCGCAGGAGGGTGGGTAGTTTGCCCAGCGTAGGGGGGCTGGGCCCATAAAAGAGGAAGTG 1581 

181 C AC T TAAGAC AC GGC CCCGC TGGAC GC T TGTT AGAAACCGTCC T GGC T GGGAAGGC AAGA 240 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
1582 C AC T TAAGAC AC GGC CCCGC TGGAC GC - T GTT AGAAAC CGT CC T GGC TGGGAAGGCAAGA 1640 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



241 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 300 
I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1641 GGTGTGTGAC TGGAC AAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 1700 



301 



360 



GCGGC C CGAGAGAGT GTGAGGCAGAGGC GGGGAGTGGC AAGGGAGTGAC C AT CTCGGGGA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I 
1701 GC GGC C C GAGAGAGT GTGAGGCAGAGGC GGGG AGTGGCAAGGGAGT GAC CAT CTCGGGGA 17 60 



361 



420 



ACGAAGGAGTAAACGCGGTGATGGGACGCACGGAAACGGGAGTGGAGAAAGTCATGGAGA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
17 61 AC GAAGGAGTAAACGCGGT GATGGGACGC ACGGAAACGGGAGTGGAGAAAGT C ATGGAGA 1820 



421 



480 



GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGC TGCTCCAGGGTC TCCGCACCCAAG 

I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
1821 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGC TGCTCCAGGGTC TCCGCACCCAAG 1880 



481 



540 



TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 

I I I II I I I I I I I I I I I I I I I I I I I I I II I I I M I I I I I I I II I I I I I II I I I I I I I I II I 
1881 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 1940 



541 



600 



TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 
I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
1941 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 2000 
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Qy 601 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 2001 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 2037 



RESULT 2 

US-09-949-016-14689 

; Sequence 14689, Application US/09949016 

; Patent No. 6812339 

; GENERAL INFORMATION: 

; APPLICANT: VENTER, J. Craig et al. 

; TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES THEREOF 
; FILE REFERENCE: CL001307 

; CURRENT APPLICATION NUMBER: US/09/94 9,016 

; CURRENT FILING DATE: 2000-04-14 

; PRIOR APPLICATION NUMBER: 60/241,755 

; PRIOR FILING DATE: 2000-10-20 

; PRIOR APPLICATION NUMBER: 60/237,768 

; PRIOR FILING DATE: 2000-10-03 

; PRIOR APPLICATION NUMBER: 60/231,498 

; PRIOR FILING DATE: 2000-09-08 

; NUMBER OF SEQ ID NOS : 207012 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 14689 

LENGTH: 3 9754 

TYPE: DNA 

ORGANISM: Human 

FEATURE : 

NAME/KEY: misc_feature 
LOCATION: ( 1 )...( 39754 ) 
OTHER INFORMATION: n = A, T,C or G 
US-09-949-016-14689 

Query Match 98.1%; Score 625; DB 3; Length 39754; 

Best Local Similarity 99.8%; Pred. No. 2.1e-131; 

Matches 63 6; Conservative 0; Mismatches 0; Indels 1; Gaps 1; 



Qy 1 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 60 

I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 29317 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 2937 6 

Qy 61 GACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCGCCCTGCAGGAGCCAGCCGCGCGCTCGC 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 29377 GACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCGCCCTGCAGGAGCCAGCCGCGCGCTCGC 29436 

Qy 121 TCGCAGGAGGGTGGGTAGT T TGCCCAGC GTAGGGGGGCTGGGCC CATAAAAGAGGAAGTG 180 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 29437 TCGCAGGAGGGTGGGTAGT T TGCCCAGC GTAGGGGGGCTGGGCC CATAAAAGAGGAAGTG 29496 

Qy 181 CAC T TAAGAC AC GGC C CC GC TGGAC GC T T GTT AGAAAC CGTCC T GGC T GGGAAGGC AAGA 240 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 29497 CAC T TAAGACAC GGC C CC GC TGGAC GC - T GTTAGAAAC C GT CC T GGC TGGGAAGGC AAGA 29555 

Qy 241 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 300 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 29556 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 29615 

Qy 301 GCGGC CC GAGAGAGT GTGAGGC AGAGGC GGGGAGTGGC AAGGGAGTGACC AT C TCGGGGA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I II I I I 
Db 29616 GCGGCCC GAGAGAGT GTGAGGC AGAGGC GGGGAGTGGC AAGGGAGTGAC CATC TCGGGGA 29675 
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Qy 361 AC GAAGG AGTAAAC GC GGT GAT GGGAC GCACGGAAACGGG AGTGGAGAAAGT C AT GGAGA 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 29676 AC GAAGGAGTAAAC GC GGT GATGGGAC GC ACGGAAACGGGAGTGGAGAAAGT CAT GGAGA 29735 

Qy 421 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 480 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 2 9736 GAACC C T AGGC GGGGCGGT C CC C GC GGAAAGGC GGC TGC T CC AGGGTC TC C GC AC C C AAG 29795 

Qy 481 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 97 96 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 2 9855 

Qy 541 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 29856 T TAAGC C GCGC C GC C GCC T GC GCGGAGC C C C AC TGC GAAGC C C AGC TGCGCGC GCC T TGG 29915 

Qy 601 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 2 9916 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 29952 



RESULT 3 

US-08-846-012A-1 

; Sequence 1, Application US/0884 6012A 

; Patent No. 5807740 

; GENERAL INFORMATION: 

APPLICANT: AMARAL, M. Catherine. 

APPLICANT: CHEN, Jin-Long 

TITLE OF INVENTION: Regulators of UCP2 Gene Expression 
NUMBER OF SEQUENCES: 16 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

STREET: 2 68 BUSH STREET, SUITE 3200 

CITY: SAN FRANCISCO 

STATE: CALIFORNIA 

COUNTRY : USA 

ZIP : 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS /MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/846, 012A 

FILING DATE: 

CLASSIFICATION: 435 
ATTORNEY /AGENT INFORMATION: 

NAME: OSMAN, RICHARD A 

REGISTRATION NUMBER: 36,627 

REFERENCE /DOCKET NUMBER: T97-003 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 343-4341 

TELEFAX: (415) 343-4342 
; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 736 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
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US-08-846-012A-1 

Query Match 76.1%; Score 485; DB 2; Length 736; 

Best Local Similarity 99.6%; Pred. No. 4.1e-100; 

Matches 507; Conservative 0; Mismatches 0; Indels 2; Gaps 2; 

Qy 129 GGG TGGG T AGT T T GC C CAGC GTAGGGGGGC TGGGC C C ATAAAAG AGGAAGT GC ACT TAAG 188 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I 1 I I I I I II I I I I I I I I 
Db 1 GGG TGGG T AGT T T GC C C AGC GTAGGGGGGCTGGGC C C ATAAAAG AGGAAGT GCAC T TAAG 60 

Qy 189 AC AC GGC CC CGC T GGACGC T TGT T AGAAACC G TC C T GGC T GGGAAGGC AAGAGGTG T GTG 248 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 ACACGGCCCCGCTGGACGC-TGTTAGAAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTG 119 

Qy 24 9 ACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTGGCGGCCCG 308 

II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 120 ACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTGGCGGCCCG 179 

Qy 309 AGAGAGT GTGAGGCAGAGGCGGGGAGTGGCAAGGGAGTGACCATC TCGGGGAACGAAGGA 368 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 180 AGAGAGT GTGAGGCAGAGGC GGGGAGTGGCAAGGGAGTGAC CAT C T CGGGGAAC GAAGGA 239 

Qy 369 GTAAAC GCGGTGAT GGGAC GCAC GGAAACGGGAGT GGAGAAAGT CATGGAGAGAAC CC TA 428 

I I I I I I I I II I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 240 GTAAAC GCGGTGATGGGAC GCAC GGAAAC GGG AGT GGAGAAAGT CATGGAGAGAAC C C TA 299 

Qy 429 GGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCT 488 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 300 GGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAG-T 358 

Qy 48 9 GGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCG 548 

I I I I I I I I I I I I I I I I I I I II I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 359 GGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCG 418 

Qy 54 9 CGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACT 608 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 419 CGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACT 478 

Qy 609 GTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I 
Db 47 9 GTCCACGCTCGCCCGGCTCGTCCGACGCG 507 



RESULT 4 
US-09-100-297-1 

; Sequence 1, Application US/091002 97 

; Patent No. 5849514 

; GENERAL INFORMATION: 

APPLICANT: AMARAL, M. Catherine. 
APPLICANT: CHEN, Jin- Long 
; TITLE OF INVENTION: Regulators of UCP2 Gene Expression 
NUMBER OF SEQUENCES: 16 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE : CALIFORNIA 
COUNTRY : USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
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COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS /MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/100,297 

FILING DATE: 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/84 6,012 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 

NAME: OSMAN, RICHARD A 

REGISTRATION NUMBER: 36,627 

REFERENCE /DOCKET NUMBER: T97-003 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 343-4341 

TELEFAX: (415) 343-4342 
; INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 7 36 base pairs 
; TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-09-100-297-1 

Query Match 76.1%; Score 485; DB 2; Length 736; 

Best Local Similarity 99.6%; Pred. No. 4.1e-100; 

Matches 507; Conservative 0; Mismatches 0; Indels 2; Gaps 2; 



Qy 12 9 GGGTGGGTAGT TTGCCCAGCGTAGGGGGGC TGGGCC CATAAAAGAGGAAGTGC AC TTAAG 188 

I I I I I I I I I I I I I 1 I I I I I I I t I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I II I I I I I 
Db 1 GGG T GGGT AGT T TGC C C AGCGT AGGGGGGC TGGGCC CATAAAAGAGGAAGTGC AC T TAAG 60 

Qy 189 ACAC GGC CC C GC TGGACGC T T GT T AGAAAC C G T C C T GGC T GGGAAGGC AAGAGGTGTGTG 248 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ACAC GGC CCCGC TGGACGC -TGTTAGAAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTG 119 

Qy 24 9 ACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTGGCGGCCCG 308 

I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 120 ACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTGGCGGCCCG 179 

Qy 309 AGAGAGT GTGAGGCAGAGGC GGGGAGTGGC AAGGGAGTGAC CAT C TCGGGGAACGAAGGA 368 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 1 I I E I I I I I I I I 
Db 180 AGAGAGT GTGAGGCAGAGGC GGGGAGTGGCAAGGGAGT GAC CAT C TCGGGGAACGAAGGA 239 

Qy 369 GTAAACGCGGTGATGGGAC GCACGGAAACGGGAGTGGAGAAAGTCATGGAGAGAACCC TA 428 

I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 240 GTAAACGC GGT GATGGGAC GCAC GGAAAC GGGAGTGGAGAAAGT C ATGGAGAGAAC CC TA 299 

Qy 429 GGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCT 488 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I II I I I I I I 
Db 300 GGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAG-T 358 

Qy 489 GGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCG 548 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 359 GGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCG 418 

Qy 54 9 CGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACT 608 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 419 CGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACT 478 
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Qy 609 GTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I II I I I I I I II I I I I I I I I I 
Db 479 GTCCACGCTCGCCCGGCTCGTCCGACGCG 507 



RESULT 5 

US-09-949-016-2947 

Sequence 2947, Application US/09949016 
Patent No. 6812339 
GENERAL INFORMATION : 
APPLICANT: VENTER, J. Craig et al. 

TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES THEREOF 
FILE REFERENCE: CL001307 

CURRENT APPLICATION NUMBER: US/09/949,016 
CURRENT FILING DATE: 2000-04-14 
PRIOR APPLICATION NUMBER: 60/241,755 
PRIOR FILING DATE: 2000-10-20 
PRIOR APPLICATION NUMBER: 60/237,768 
PRIOR FILING DATE: 2000-10-03 
PRIOR APPLICATION NUMBER: 60/231,498 
PRIOR FILING DATE: 2000-09-08 
NUMBER OF SEQ ID NOS : 207012 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 2947 
LENGTH: 2705 
TYPE: DNA 
ORGANISM: Human 
US-09-949-016-2947 

Query Match 14.5%; Score 92.4; DB 3; Length 2705; 

Best Local Similarity 98.9%; Pred. No. le-11; 

Matches 93; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 544 AGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGAT 603 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 1681 AACCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGAT 1740 

Qy 604 TGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I II I I I I I I I I I M I I I I I I I ! I 
Db 1741 TGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 1774 



RESULT 6 

US-08-232-463-14/C 

; Sequence 14, Application US/08232463 

; Patent No. 5670367 

; GENERAL INFORMATION: 

APPLICANT: DORNER, F. 

APPLICANT: SCHEIFLINGER, F. 

APPLICANT: FALKNER, F. G. 

TITLE OF INVENTION: RECOMBINANT FOWLPOX VIRUS 
NUMBER OF SEQUENCES: 52 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Foley & Lardner 

STREET: 1800 Diagonal Road, Suite 500 
; CITY: Alexandria 

STATE: VA 
COUNTRY : USA 
ZIP: 22313-0299 
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COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/232,463 
FILING DATE: 
CLASSIFICATION : 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/07/935,313 
FILING DATE: 

APPLICATION NUMBER: EP 91 114 300.6 
FILING DATE: 26-AUG-1991 
ATTORNEY/AGENT INFORMATION: 
NAME : BENT, Stephen A. 
REGISTRATION NUMBER: 29,768 
REFERENCE/DOCKET NUMBER: 30472/114 IMMU 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (703) 8 36-9300 
TELEFAX: ( 703 ) 683-4109 
TELEX: 899149 
INFORMATION FOR SEQ ID NO: 14: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 7218 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 
CLONE: pTZgpt-Fls 
US-08-232-463-14 

Query Match 9.4%; Score 60; DB 2; Length 7218; 

Best Local Similarity 4.4%; Pred. No. 0.00022; 

Matches 18; Conservative 230; Mismatches 160; Indels 0; Gaps 0; 
Qy 117 TC GC TC GC AGGAGGGTGGGT AGT T TGCC CAGC GTAGGGGGGC T GGGCC C ATAAAAGAGGA 17 6 

Db 1440 TGGTACRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1381 

Qy 177 AGT GC AC T TAAGAC AC GGC C CCGC TGGACGC T TGT T AGAAACCGTCC T GGC T GGGAAGGC 236 

Db 1380 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1321 

Qy 237 AAGAGGT GT GT GAC T GGAC AAGAC TT GT T TC T GGC GGTC AGTC T TGCC AT CC TC AC AGAG 296 

Db 1320 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1261 

Qy 297 GTTGGCGGCCCGAGAGAGTGTGAGGCAGAGGCGGGGAGTGGCAAGGGAGTGACCATCTCG 356 

Db 1260 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1201 

Qy 357 GGGAACGAAGGAGTAAACGCGGTGATGGGACGCACGGAAACGGGAGTGGAGAAAGTCATG 416 

Db 1200 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1141 

Qy 417 GAGAGAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACC 47 6 

Db 1140 RRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR 1081 

Qy 477 CAAGT AGGAGC T GGC AGGC C CGGC CCCGC C C C GC AGGC CC C ACC C C GG 524 



http://es/ScoreAccessWeb/Getftem.action?AppId-09869098&seqId=532380&ItemNam 6/7/2006 



suuKfc searcn Results Details tor Application uvsoyuys ana searcn Kesuit us-uy-soy... I'age iu ot zu 



::: ::::: :: II I III I I II I I III 

Db 1080 RRRRRRRRRRRRRRATCGCAAGCTCCCTCGACCTGCAGCCAAGCTCGG 1033 



RESULT 7 

US-09-513-999C-14461 

; Sequence 14461, Application US/09513999C 
; Patent No. 6783961 
; GENERAL INFORMATION: 

APPLICANT: Dumas Milne Edwards, J.B. 
; APPLICANT: Duclert, A. 
; APPLICANT: Giordano, J.Y. 

; TITLE OF INVENTION: Expressed Sequence Tags and Encoded Human Proteins. 

; Patent No. 6783961 

; FILE REFERENCE: 5 9. US 2. REG 

; CURRENT APPLICATION NUMBER: US/09/513 , 9 99C 

; CURRENT FILING DATE: 2000-02-24 

; PRIOR APPLICATION NUMBER: US 60/122,487 

; PRIOR FILING DATE: 1999-02-26 

; NUMBER OF SEQ ID NOS : 36681 

SOFTWARE: Pa tent. pm 
; SEQ ID NO 14461 

LENGTH: 161 

TYPE: DNA 
; ORGANISM: Homo sapiens 

FEATURE : 
; NAME/ KEY : misc_feature 

LOCATION: 11 

OTHER INFORMATION: s=g or c 
US-09-513-999C-14461 

Query Match 8.9%; Score 57; DB 3; Length 161; 

Best Local Similarity 96.6%; Pred. No. 0.00068; 

Matches 57; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 579 AGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I : I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 9 AASCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 67 



RESULT 8 

US-09-605-703B-543/c 

Sequence 543, Application US/09605703B 
Patent No. 6962 989 
GENERAL INFORMATION: 
APPLICANT: Pompejus, Markus 
APPLICANT: Kroger, Burkhard 
APPLICANT: Schroder, Hartwig 
APPLICANT: Zelder, Oskar 
APPLICANT: Haberhauer, Gregor 

TITLE OF INVENTION: CORYNE BACTERIUM GLUTAMICUM GENES ENCODING NOVEL 
TITLE OF INVENTION: PROTEINS 
FILE REFERENCE: BGI-129CP 

CURRENT APPLICATION NUMBER: US/09/605, 703B 
CURRENT FILING DATE: 2000-06-27 
PRIOR APPLICATION NUMBER: 60/142,764 
PRIOR FILING DATE: 1999-07-08 
PRIOR APPLICATION NUMBER: 60/152,318 
PRIOR FILING DATE: 1999-09-03 
NUMBER OF SEQ ID NOS: 2934 
SEQ ID NO 543 
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LENGTH: 1401 
TYPE: DNA 

ORGANISM: Corynebacterium glutamicum 

FEATURE : 

NAME/ KEY : CDS 

LOCATION: (101) . . (1378) 

OTHER INFORMATION: RXN01229 

NAME/ KEY : VARIANT 

LOCATION: 119 

OTHER INFORMATION: Xaa = Tyr or Ser 
NAME/KEY: VARIANT 
LOCATION: 120 

OTHER INFORMATION: Xaa = Asn, Thr, His, or Pro 
NAME/KEY: VARIANT 
LOCATION: 121, 266 

OTHER INFORMATION: Xaa = Glu or Val 
NAME/KEY: VARIANT 
LOCATION: 122, 303 

OTHER INFORMATION: Xaa = Asn or Thr 
NAME/ KEY : VARIANT 
LOCATION: 123, 301 

OTHER INFORMATION: Xaa = Glu, Asp, Gin, or Pro 
NAME/KEY: VARIANT 
LOCATION: 124, 265 

OTHER INFORMATION: Xaa = Glu, Ala, Lys, or Asn 
NAME/KEY: VARIANT 
LOCATION: 125 

OTHER INFORMATION: Xaa = Glu, Ala, His, Pro, or Gin 

NAME/KEY: VARIANT 

LOCATION: 126, 239, 283 

OTHER INFORMATION: Xaa = Ala 

NAME/KEY: VARIANT 

LOCATION: 127, 251 

OTHER INFORMATION: Xaa = Glu, Ala, Asp, Lys, Thr, or Asn 
NAME/KEY: VARIANT 
LOCATION: 128 

OTHER INFORMATION: Xaa = lie or Thr 
NAME/KEY: VARIANT 
LOCATION: 129 

OTHER INFORMATION: Xaa = Gly or Glu 
NAME/KEY: VARIANT 
LOCATION: 130 

OTHER INFORMATION: Xaa = Lys, lie, or Asn 
NAME/KEY: VARIANT 
LOCATION: 132 

OTHER INFORMATION: Xaa = Lys, Gin, Asn, or His 
NAME/KEY: VARIANT 
LOCATION: 175 

OTHER INFORMATION: Xaa = Leu or Ser 

NAME/KEY: VARIANT 

LOCATION: 178, 230, 295 

OTHER INFORMATION: Xaa = He or Leu 

NAME/KEY: VARIANT 

LOCATION: 180, 211 

OTHER INFORMATION: Xaa = Pro 

NAME/KEY: VARIANT 

LOCATION: 181, 223, 244 

OTHER INFORMATION: Xaa = Ala or Ser 

NAME/KEY: VARIANT 

LOCATION: 187 

OTHER INFORMATION: Xaa = Val 
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NAME/KEY: VARIANT 

LOCATION: 188, 198, 219, 302, 306 
OTHER INFORMATION: Xaa = Gly or Val 
NAME/KEY: VARIANT 
LOCATION: 193 

OTHER INFORMATION: Xaa = Val or Leu 
NAME/KEY: VARIANT 
LOCATION: 204, 282 

OTHER INFORMATION: Xaa = Ala or Thr 
NAME/KEY: VARIANT 
LOCATION: 205 

OTHER INFORMATION: Xaa = Gin or Pro 
NAME/KEY: VARIANT 
LOCATION: 206, 220, 

OTHER INFORMATION: Xaa = Val or lie 
NAME/KEY: VARIANT 
LOCATION: 207 

OTHER INFORMATION: Xaa = Ser or Pro 
NAME/KEY: VARIANT 
LOCATION: 210, 298 

OTHER INFORMATION: Xaa = Phe or Ser 
NAME/KEY: VARIANT 

LOCATION: 214, 238, 243, 252, 257, 269 
OTHER INFORMATION: Xaa = Gly or Ala 
NAME/KEY: VARIANT 

LOCATION: 216, 217, 225, 249, 258, 319 
OTHER INFORMATION: Xaa = Gly or Arg 
NAME/KEY: VARIANT 
LOCATION: 218, 236 

OTHER INFORMATION: Xaa = Leu or Phe 

NAME/KEY: VARIANT 

LOCATION: 221, 28 6 

OTHER INFORMATION: Xaa = Gly 

NAME/ KEY : VARIANT 

LOCATION: 226, 29 9 

OTHER INFORMATION: Xaa = Leu or Pro 
NAME/KEY: VARIANT 
LOCATION: 234 

OTHER INFORMATION: Xaa = Lys or Thr 
NAME/KEY: VARIANT 
LOCATION: 235 

OTHER INFORMATION: Xaa = Gin or Leu 
NAME/KEY: VARIANT 
LOCATION: 237, 291 

OTHER INFORMATION: Xaa = Gly, Val, Trp, Leu, Cys, or Phe 
NAME/KEY: VARIANT 
LOCATION: 240, 256 

OTHER INFORMATION: Xaa = Val or Ala 
NAME/KEY: VARIANT 
LOCATION: 241, 254 

OTHER INFORMATION: Xaa = Val, Leu, or Phe 
NAME/KEY: VARIANT 
LOCATION: 242 

OTHER INFORMATION: Xaa = Gly, Cys, or STOP codon 
NAME/KEY: VARIANT 
LOCATION: 245 

OTHER INFORMATION: Xaa = Gly, Ala, Trp, Ser, or STOP codon 
NAME/KEY: VARIANT 
LOCATION: 247 

OTHER INFORMATION: Xaa = Lys or Gin 
NAME/KEY: VARIANT 
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LOCATION: 248 

OTHER INFORMATION: Xaa = Arg or Lys 
NAME/KEY: VARIANT 
LOCATION: 250 

OTHER INFORMATION: Xaa = Gly, Glu, Arg, Lys, Asp, Asn, 
OTHER INFORMATION: or Ser 
NAME/KEY: VARIANT 
LOCATION: 253 

OTHER INFORMATION: Xaa = Gly, Val, Cys, or Phe 
NAME/KEY: VARIANT 
LOCATION: 259 

OTHER INFORMATION: Xaa = Cys, Phe, Arg, or Leu 
NAME/KEY: VARIANT 
LOCATION: 260, 310 

OTHER INFORMATION: Xaa = Cys or Phe 

NAME/KEY: VARIANT 

LOCATION: 261, 262 

OTHER INFORMATION: Xaa = Arg 

NAME/KEY: VARIANT 

LOCATION: 263, 296 

OTHER INFORMATION: Xaa = Arg or Leu 
NAME/ KEY: VARIANT 
LOCATION: 264, 289 

OTHER INFORMATION: Xaa = Glu, Val, Asp, Gin, Leu, 
OTHER INFORMATION: or His 
NAME /KEY: VARIANT 
LOCATION: 267 

OTHER INFORMATION: Xaa = Val, Met, or He 
NAME/KEY: VARIANT 
LOCATION: 268 

OTHER INFORMATION: Xaa = Gly, Val, Ser, or He 
NAME/KEY: VARIANT 
LOCATION: 270 

OTHER INFORMATION: Xaa « Asp or Ala 
NAME /KEY: VARIANT 
LOCATION: 271 

OTHER INFORMATION: Xaa = Asp, Val, His, or Leu 
NAME/KEY: VARIANT 
LOCATION: 272 

OTHER INFORMATION: Xaa = Leu 
NAME/KEY: VARIANT 
LOCATION: 273, 307 

OTHER INFORMATION: Xaa = Val, Ala, Leu, or Pro 
NAME /KEY: VARIANT 
LOCATION: 274 

OTHER INFORMATION: Xaa = Asp, Ala, or Glu 
NAME/KEY: VARIANT 
LOCATION: 275 

OTHER INFORMATION: Xaa = Thr or His 
NAME/KEY: VARIANT 
LOCATION: 276 

OTHER INFORMATION: Xaa = Gin, His, or Pro 
NAME/KEY: VARIANT 
LOCATION: 277, 278, 285 

OTHER INFORMATION: Xaa = Gly, Pro, Arg, or Ala 
NAME/KEY: VARIANT 
LOCATION: 279, 284 

OTHER INFORMATION: Xaa = Ala or Pro 
NAME/KEY: VARIANT 

Query Match 8.0%; Score 51; DB 4; Length 1401; 
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Best Local Similarity 14.0%; Pred. No. 0.02; 

Matches 36; Conservative 12 3; Mismatches 98; Indels 0; Gaps 0; 

Qy 318 GAGGC AGAGGC GGGGAGTGGCAAGGGAGT GAC CATC TC GGGGAAC GAAGGAG TAAAC GCG 377 

I I I I : :: | :::::: : : : I : I : : : : :::::::: : : :: 

Db 1061 GTGGCCYWMYYSGKVTOIOrSKYYTYYYMRRAA 1002 

Qy 378 GTG ATGGGAC GC ACGGAAAC GGGAGTGGAGAAAGT C ATGGAGAGAACC C TAGGCGGGGC G 437 

: : :|:: : |:| :::::: : | ::|:::: ::: :: :| :: 
Db 1001 SCSGRRGRRAAAMRMGWAKWSWSMSSSRSMMYAWMWSKSGMSMSCCKSSKGSRGCRGYSS 942 

Qy 438 GTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCC 497 

:|: :::: : : :::|: : : I I : : ||: :: I :: : :::::: | 
Db 941 WYCMWGSMSSCSSMKGRTSKKCWRSRAGGWSGKCGSCGMYSAYYWCKTYSWS SMGKCGWC 882 

Qy 498 GGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGC 557 

I : :::|:::|:: | |:|: :::::: :||:: |: :::|:::| |:: 

Db 881 GAMAAMRMCSMSCMRCCACMAMAMMMSCMKYMYYCCYYYTYTKTYYYSMSGMRSCWCMKA 822 

Qy 558 CTGCGCGGAGCCCCACT 574 

:::|: : ::::| I 
Db 821 MWRCKGCSSCSMMSAAT 805 



RESULT 9 

US-09-949-016-11959 

Sequence 11959, Application US/09949016 
Patent No. 6812339 
GENERAL INFORMATION: 
APPLICANT: VENTER, J. Craig et al. 

TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES THEREOF 
FILE REFERENCE: CL001307 

CURRENT APPLICATION NUMBER: US/09/949,016 
CURRENT FILING DATE: 2000-04-14 
PRIOR APPLICATION NUMBER: 60/241,755 
PRIOR FILING DATE: 2000-10-20 
PRIOR APPLICATION NUMBER: 60/237,768 
PRIOR FILING DATE: 2000-10-03 
PRIOR APPLICATION NUMBER: 60/231,498 
PRIOR FILING DATE: 2000-09-08 
NUMBER OF SEQ ID NOS : 207012 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 11959 
LENGTH: 69834 
TYPE: DNA 
ORGANISM: Human 
FEATURE : 

NAME/KEY: misc_feature 
LOCATION: (1) . . . (69834) 
OTHER INFORMATION: n = A, T,C or G 
US-09-949-016-11959 

Query Match 7.6%; Score 48.4; DB 3; Length 69834; 

Best Local Similarity 52.5%; Pred. No. 0.12; 

Matches 10 6; Conservative 0; Mismatches 96; Indels 0; Gaps 0; 

Qy 436 CGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGC 495 

I I I I I I I I III II I III I I I I I I I I I I I I 

Db 38656 CGCTCCCCTGCCACCCCCGCCCCTGCGACCCTGCCCGAGCCCAGCCTAGCGCAGCCCGGC 38715 
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Qy 4 96 CCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCC 555 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I III 
Db 38716 CAGGCCCGGCCGCGGGGGGCGCCGTCCCGCCCGCCCGCCCGAGCCCGTACCCGCCGCGGC 38775 

Qy 556 GCC T GC GCGGAGCCC C AC T GCGAAGC C C AGC T GCGC GCGC C T TGGGAT TGAC TGTC C ACG 615 

I II I I I I II I I III II I I I I I I I I I I II 

Db 38776 GCCGGTGCGTCACGCTGCGTCAGGGCCGCGCCCCGCGCGACGTCGCTGCCTCGCGGCCCC 38835 

Qy 616 CTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I III 
Db 38836 GCCCCCAGGCCCGACATCCTCG 38857 



RESULT 10 

US-09-949-016-12925 

Sequence 12925, Application US/09949016 
Patent No. 6812339 
GENERAL INFORMATION: 
APPLICANT: VENTER, J. Craig et al. 

TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES THEREOF 
FILE REFERENCE: CL001307 

CURRENT APPLICATION NUMBER: US/09/94 9,016 
CURRENT FILING DATE: 2000-04-14 
PRIOR APPLICATION NUMBER: 60/241,755 
PRIOR FILING DATE: 2000-10-20 
PRIOR APPLICATION NUMBER: 60/237,768 
PRIOR FILING DATE: 2000-10-03 
PRIOR APPLICATION NUMBER: 60/231,498 
PRIOR FILING DATE: 2000-09-08 
NUMBER OF SEQ ID NOS : 207012 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 12925 
LENGTH: 69834 
TYPE: DNA 
ORGANISM: Human 
FEATURE : 

NAME /KEY : misc_f eatur e 
LOCATION: (1) . . . (69834) 
OTHER INFORMATION: n = A, T,C or G 
US-09-949-016-12925 

Query Match 7.6%; Score 48.4; DB 3; Length 69834; 

Best Local Similarity 52.5%; Pred. No. 0.12; 

Matches 106; Conservative 0; Mismatches 96; Indels 0; Gaps 0; 

Qy 436 CGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGC 495 

I I I I I I I I III II I III I I I I I I I I I I I I 

Db 38656 CGCTCCCCTGCCACCCCCGCCCCTGCGACCCTGCCCGAGCCCAGCCTAGCGCAGCCCGGC 38715 

Qy 496 CCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCC 555 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I III 
Db 38716 CAGGCCCGGCCGCGGGGGGCGCCGTCCCGCCCGCCCGCCCGAGCCCGTACCCGCCGCGGC 38775 

Qy 556 GC C TGC GCGGAGC C C C AC TGCGAAGC C C AGC T GC GC GC GC C T T GGGAT TGAC TGTC C AC G 615 

I I I I I I I II I I III II I I I I I I I I I I M 

Db 38776 GCCGGTGCGTCACGCTGCGTCAGGGCCGCGCCCCGCGCGACGTCGCTGCCTCGCGGCCCC 38835 

Qy 616 CTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I III 
Db 38836 GCCCCCAGGCCCGACATCCTCG 38857 
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RESULT 11 

US-09-949-016-17110/C 

Sequence 17110, Application US/09949016 
Patent No. 6812339 
GENERAL INFORMATION: 
APPLICANT: VENTER, J. Craig et al. 

TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES THEREOF 
FILE REFERENCE: CL001307 

CURRENT APPLICATION NUMBER: US/09/949,016 
CURRENT FILING DATE: 2000-04-14 
PRIOR APPLICATION NUMBER: 60/241,755 
PRIOR FILING DATE: 2000-10-20 
PRIOR APPLICATION NUMBER: 60/237,768 
PRIOR FILING DATE: 2000-10-03 
PRIOR APPLICATION NUMBER: 60/231,498 
PRIOR FILING DATE: 2000-09-08 
NUMBER OF SEQ ID NOS : 207012 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 17110 
LENGTH: 70014 
TYPE: DNA 
ORGANISM: Human 
FEATURE : 

NAME /KEY : misc_f eature 
LOCATION: (1) ... (70014) 
OTHER INFORMATION: n = A, T,C or G 
US-09-949-016-17110 

Query Match 7.1%; Score 45.4; DB 3; Length 70014; 

Best Local Similarity 47.6%; Pred. No. 0.57; 

Matches 165; Conservative 0; Mismatches 181; Indels 1; Gaps 1; 

Qy 248 GAC TGGACAAG AC T T GT T TC TGGC GGTC AGT C T T GC CATC C TC AC AGAGGT TGGC GGCCC 307 

I I I I I I I I I I I I I I I III III II I I I I 

Db 2224 GC C TGC AGCGAGC GC GGC TC C GGC GGAGGTC C CGGGC AGGC TCC GGGC GCCAGGC GC GAG 2165 

Qy 308 GAGAGAGTGTGAGGC AGAGGC GGGGAGT GGCAAGGGAGTGAC CAT C TC GGGGAAC GAAGG 367 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2164 C AGAGAAGGC GGAGACGC GGCC TC CGGT TGC C AGGT GCGC GTC T C C AC GC AGAGGC C C AG 2105 

Qy 368 AGTAAAC GCGGTGATGGGACGCAC GGAAACGGGAGT GGAGAAAGTC ATGGAGAGAACCCT 427 

I I I I II I I I I I I I I I I I I I II II 

Db 2104 GT AGGAGGAGGAGAAGGGGC GGGGAGAAAAGGAGGAC GC T TGAC C CGC T T TAGCGC GGC T 2045 

Qy 428 AGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGC 487 

II I I I I I I I I I I I III III II I I I I I I I 

Db 2044 CTGGGCTCCCGCCGCCGCGCAGGCCCCTCCCGCCCCGCCCCCCGGAGCCGGACTCGGAGC 1985 

Qy 488 TGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGG-CTTAAGC 546 

III I II I I I I III III I I I I I I I I I I I I II I I I II 

Db 1984 CGGCTTGTCCCGGCCCCGCACACTTCTCCCTGCCCGGGCCTCCCGCACCAGGTCTCGCGC 1925 

Qy 547 CGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGC 593 

III II I I I I I I I I I I I I I II I I I I I I 

Db 1924 CGCAGGCCCCGCCCCTCTGCGCTGCACTGCGGTTCCCACAGCCCCGC 1878 



RESULT 12 
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US-09-949-016-13822 

; Sequence 13822, Application US/09949016 

; Patent No. 6812339 

; GENERAL INFORMATION: 

; APPLICANT: VENTER, J. Craig et al. 

; TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

; TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES THEREOF 
; FILE REFERENCE: CL001307 

; CURRENT APPLICATION NUMBER: US/09/949,016 

; CURRENT FILING DATE: 2000-04-14 

; PRIOR APPLICATION NUMBER: 60/241,755 

; PRIOR FILING DATE: 2000-10-20 

; PRIOR APPLICATION NUMBER: 60/237,768 

; PRIOR FILING DATE: 2000-10-03 

; PRIOR APPLICATION NUMBER: 60/231,498 

; PRIOR FILING DATE: 2000-09-08 

; NUMBER OF SEQ ID NOS : 207012 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 13822 

LENGTH: 27270 

TYPE: DNA 

ORGANISM: Human 
US-09-949-016-13822 

Query Match - 7.0%; Score 44.8; DB 3; Length 27270; 

Best Local Similarity 45.1%; Pred. No. 0.69; 

Matches 210; Conservative 0; Mismatches 2 52; Indels 4; Gaps 1; 



Qy 40 ACGCGGCGGCTGTAACCAATCGACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCGCCCTGC 99 

II I I I I I I I I I I I I I Ell I I I I I I II II II 

Db 1130 ACATGGGGGGTGTGAGGTGGGGACAGGCAGGGAGATGGGGGGTGCGAGGTGGGGACAAGC 1189 

Qy 100 AGGAGCCAGCC GCGC GC TCGC TCGCAGGAGGGTGGGTAGT TTGC CCAGCGTAGGGGGGC T 159 

III I I I I I I III III III II II II I I I I I I 

Db 1190 AGGGACATGGGGAGTGCGAGGTGGGGACAGGCAGGGACGTGGGGCGTGCGAGGTGGGGAC 1249 

Qy 160 GGGC C CATAAAAGAGGAAGTGCAC T TAAGACAC GGC C C CGC TGG ACGC T TGT T AGAAAC C 219 

III II III III I I I I I I I I I I I I I I 

Db 1250 AGGC AGGGACATAAGGGTG TGAGG TGGAGACAGGCAGGGAC ATGGGGAGT C TGAGGT GGG 1309 

Qy 220 GTCCTGGCTGGGAAGGCAAGAGGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTC 279 

III I I I I I I I I I I I I I I I I I I I 

Db 1310 GACAGGCAGGGAGATGGGGGGTGTGAGTTGGGGACAGGCAGGGAGATGGGGGGGTGCGAG 1369 

Qy 280 T TGC CAT CC TC AC AGAGGT T GGCGGCC C GAGAGAGT GTGAGGC AGAGGC GGGGAGT GGC A 339 

II I I I I I I I I I I II I I I I I I I I II I I 

Db 1370 GTGGGGATAAGCAGGGACATGGGGGTGCGAGGTGGGGACAGGCAGGGACATGGGGGTGGG 1429 

Qy 340 AGGGAGTGACCATCTCGGGGAACGAAGGAGTAAACGCGG TGATGGGAC GC AC GGAA 395 

I II I I I I I I I I I I I I I II I I I I I II 

Db 1430 AGG TGGGGAC AGGC AGGGAC ATAGGGGG TGGGAGGT GGGC AC AT TC AGGGAGATGGGGGT 1489 

Qy 396 AC GGGAGTGGAGAAAGTC AT GGAGAGAAC CC T AGGC GGGGCGGT CC CC GCGGAAAGGC GG 455 

II I I I I I I I I I I I I I I II II II I III II II 

Db 1490 GTGAGGTGGGAGACAGGCAGGGACATAGGGGGAGCGAGGTGGGGACAGGCAGGGAGATGG 1549 

Qy 456 CTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCC 501 

I I II I I I II I I I I III III III 

Db 1550 AGGGT GC AAGGTGAAGAC AGGCAGGGAC ATGG TGTGAGGT GGGGAC 1595 
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RESULT 13 
US-09-827-688-8 

; Sequence 8, Application US/09827688 

; Patent No. 6821955 

; GENERAL INFORMATION: 

; APPLICANT: ORSON, FRANK 

; APPLICANT: KINSEY, BERMA 

; APPLICANT: BHOGAL , BALBIR 

; TITLE OF INVENTION: MACROAGGREGATED PROTEIN CONJUGATES AS ORAL GENETIC IMMUNIZATION 

; TITLE OF INVENTION: AGENTS 

; FILE REFERENCE: P01949US1/10004014 

; CURRENT APPLICATION NUMBER: US/09/827,688 

; CURRENT FILING DATE: 2001-04-06 

; PRIOR APPLICATION NUMBER: 60/195,680 

; PRIOR FILING DATE: 2000-04-07 

; NUMBER OF SEQ ID NOS : 13 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 8 

LENGTH: 15474 6 

TYPE: DNA 

ORGANISM: HERPESVIRUS 2 
US-09-827-688-8 

Query Match 7.0%; Score 44.6; DB 3; Length 154746; 

Best Local Similarity 50.2%; Pred. No. 0.94; 

Matches 110; Conservative 0; Mismatches 109; Indels 0; Gaps 0; 

Qy 418 AGAGAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCC 477 

I I I I II I I I I I I I I I II I I I I II I I I I I I 

Db 127637 AGAGCCCCAGACCCCCCGCGGGCGCGGGAGACGTGCCGCCCGCCCGACCCCCGCCCGCCC 127696 

Qy 478 AAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGA 537 

E I I I I I I I I I I I I I I 1 III I I I I I I I I I I I I I 

Db 127 697 GACCCCCGCCCGCCCGACCCCCGCCCGCCCGACCCCCGCCCGCCCGACCCCCGCCCGCCC 1277 56 

Qy 538 GGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCT 597 

II III I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 127757 GACCCCCGCCCGCCCGACCCCCGCCCGCCCGACCCCCGCCCGCCCGACCCCCGCCCGCCC 127816 

Qy 598 TGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGC 636 

I I I I I II I I I I I I M III 
Db 127817 GACCCCCGCCCGCCCGACCCCCGCCCGCCCGACCCCCGC 127855 



RESULT 14 
US-09-827-688-8/C 

; Sequence 8, Application US/09827688 

; Patent No. 6821955 

; GENERAL INFORMATION: 

; APPLICANT: ORSON, FRANK 

; APPLICANT: KINSEY, BERMA 

; APPLICANT: BHOGAL, BALBIR 

; TITLE OF INVENTION: MACROAGGREGATED PROTEIN CONJUGATES AS ORAL GENETIC IMMUNIZATION 

; TITLE OF INVENTION: AGENTS 

; FILE REFERENCE: P0194 9US1/10004014 

; CURRENT APPLICATION NUMBER: US/09/827,688 

; CURRENT FILING DATE: 2001-04-06 

; PRIOR APPLICATION NUMBER: 60/195,680 

; PRIOR FILING DATE: 2000-04-07 

; NUMBER OF SEQ ID NOS: 13 

; SOFTWARE: Patentln version 3.0 
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SEQ ID NO 8 

LENGTH: 154746 
TYPE: DNA 

ORGANISM: HERPESVIRUS 2 
US-09-827-688-8 

Query Match 7.0%; Score 44.6; DB 3; Length 154746; 

Best Local Similarity 50.2%; Pred. No. 0.94; 

Matches 110; Conservative 0; Mismatches 109; Indels 0; Gaps 0; 

Qy 418 AGAGAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCC 477 

I I I I II I I I I I I I I I II I I I I II I I I I I I 

Db 154105 AGAGCCCCAGACCCCCCGCGGGCGCGGGAGACGTGCCGCCCGCCCGACCCCCGCCCGCCC 154046 

Qy 478 AAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGA 537 

I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I 

Db 154045 GACCCCCGCCCGCCCGACCCCCGCCCGCCCGACCCCCGCCCGCCCGACCCCCGCCCGCCC 153986 

Qy 538 GGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCT 597 

II III I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 153985 GACCCCCGCCCGCCCGACCCCCGCCCGCCCGACCCCCGCCCGCCCGACCCCCGCCCGCCC 153926 

Qy 598 TGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGC 636 

I I I II II I I I I I I I I III 
Db 153925 GACCCCCGCCCGCCCGACCCCCGCCCGCCCGACCCCCGC 153887 



RESULT 15 

US-09-94 9-016-12505/C 

Sequence 12505, Application US/09949016 
Patent No. 6812339 
GENERAL INFORMATION : 
APPLICANT: VENTER, J. Craig et al. 

TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES THEREOF 
FILE REFERENCE: CL001307 

CURRENT APPLICATION NUMBER: US/09/949,016 
CURRENT FILING DATE: 2000-04-14 
PRIOR APPLICATION NUMBER: 60/241,755 
PRIOR FILING DATE: 2000-10-20 
PRIOR APPLICATION NUMBER: 60/237,768 
PRIOR FILING DATE: 2000-10-03 
PRIOR APPLICATION NUMBER: 60/231,498 
PRIOR FILING DATE: 2000-09-08 
NUMBER OF SEQ ID NOS : 207012 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 12505 
LENGTH: 670689 
TYPE: DNA 
ORGANISM: Human 
FEATURE : 

NAME /KEY : misc_f eatur e 
LOCATION: (1) . . . (670689) 
OTHER INFORMATION: n = A, T,C or G 
US-09-949-016-12505 

Query Match 7.0%; Score 44.4; DB 3; Length 670689; 

Best Local Similarity 54.2%; Pred. No. 1.2; 

Matches 90; Conservative 0; Mismatches 76; Indels 0; Gaps 0; 
Qy 431 CGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGG 490 
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1 1 I 1 1 1 1 1 1 II I I I I 1 1 1 III III I I I 

Db 40337 CGCGCCGCCCCCCCCGGCCCCCGCGACGCCCCACCCACCCCCGCCCCCCGCCCGCACCCC 40278 

Qy 491 CAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCG 550 

I I I I I I I I I I II I I I I I III I I I I I I I III I I I I I I I 

Db 40277 CCGGCCAGCACCCCCGACCCGCGCCCCCGCCCCCCGCCCCCCCCGCACGCCCACGCCGCG 40218 

Qy 551 CCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCC 596 

I I I I I I I II I I I I I I III I I I II I I I 

Db 40217 CCCCCGCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCGCCCGGGCC 40172 



Search completed: June 6, 2006, 20:53:54 
Job time : 17 9 sees 



SCORE 1.3 BuildDate: 12/06/2005 
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SCORE Search Results Details for Application 
09869098 and Search Result us-09-869-098a- 
l__copy_1634_2270.rng. 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 09869098 and Search Result us-09- 

869-098a-l_copy_1634_2270.rng. 

start 

Go Back to previous page 



GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title : 

Perfect score: 
Sequence : 

Scoring table : 



June 6, 2006, 20:07:44 ; Search time 548 Seconds 

(without alignments) 
8104.603 Million cell updates/sec 

US-09-869-098A-l_COPY_1634_2270 
637 

1 acctgttgctgcagttctga cgcccggctcgtccgacgcg 637 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 5244920 seqs, 3486124231 residues 

Total number of hits satisfying chosen parameters: 



10489840 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : N_Geneseq_8 : * 

1: geneseqnl980s : * 
2: geneseqnl990s : * 
3 : geneseqn2000s : * 
4: geneseqn2001as : * 
5: geneseqn2001bs : * 
6: geneseqn2002as : * 
7: geneseqn2002bs : * 
8: geneseqn2003as : * 
9 : geneseqn2003bs : * 
10 : geneseqn2003cs : * 
11: geneseqn2003ds: * 
12 : geneseqn2004as : * 
13: geneseqn2004bs: * 
14 : geneseqn2005s:* 
15 : geneseqn2006s : * 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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.2 


2452 


13 


ACN42716 


Acn42 716 Human dia 


c 


31 


45 . 8 


7 


.2 


2506 


13 


ACN42715 


Acn42715 Human dia 




32 


45 . 8 


7 


.2 


2512 


13 


ACN42714 


Acn42 714 Human dia 


Q 


33 


45 


7 


.1 


4565 


10 


ABX95723 


Abx95723 DNA encod 


Q 


34 


45 


7 


.1 


4565 


12 


ADI37007 


Ad i 37 007 Novell hum 

>J 1 v V 1 V CX 11U1LL 


C 


35 


45 


7 


.1 


4565 


12 


ADL82988 


Adl82988 Human PRO 


C 


36 


45 


7 


.1 


4565 


12 


ADL91794 


Adl917 94 Human PRO 


C 


37 


45 


7 


.1 


4565 


13 


ADP56159 


Adp56159 Human PRO 


C 


38 


45 


7 


.1 


4565 


14 


ADY14975 


Adyl4975 DNA encod 


c 


39 


45 


7 


.1 


4565 


14 


ADY20007 


Ady20007 DNA encod 


c 


40 


45 


7 


.1 


4566 


3 


AAC74710 


Aac74710 Human ORF 


c 


41 


45 


7 


.1 


4607 


5 


ABV28262 


Abv28262 Human pro 


c 


42 


45 


7 


.1 


5171 


12 


ADQ23208 


Adq23208 Human sof 


c 


43 


44.8 


7 


.0 


2413 


13 


ADT20125 


Adt20125 Plant cDN 




44 


44. 6 


7 


.0 


1281 


2 


AAQ23296 


Aaq23296 HSV-1 (MG 


c 


45 


44.6 


7 


.0 


114955 


2 


AAX53491 


Aax534 91 Human ade 



ALIGNMENTS 

RESULT 1 
AAA62 932 
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o^wivlj, ocoiui ivcauiLd l^cuma iua rvppiiwauuii U7ou^v/7o anu ocaivn xvcaun uru7-ou7-v... ^ago J 



ID AAA62932 standard; DNA; 3505 BP. 
XX 

AC AAA62932; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE DNA containing human uncoupling protein-2 (UCP-2) promoter region. 
XX 

KW Promoter; human; uncoupling protein-2; UCP-2; obesity; diabetes; 

KW hypotension; hyperlipidaemia; anti-pyretic; ds . 

XX 

OS Homo sapiens. 
XX 

PN WO200039315-A1. 
XX 

PD 06-JUL-2000. 
XX 

PF 22-DEC-1999; 99WO-JP007198 . 
XX 

PR 24-DEC-1998; 98 JP-00366719 . 
XX 

PA (TAKE ) TAKE DA CHEM IND LTD. 
XX 

PI Toyoda Y, Kobayashi M, Igaki S; 
XX 

DR WPI; 2000-452407/39. 
XX 

PT DNA with promoter region containing regulator sequence of uncoupling 

PT protein-2 (UCP-2), applicable in screening anti-obesity, anti-diabetic, 

PT hypotensive, anti-hyperlipidemic and anti-pyretic drugs for use in 

PT therapy. 
XX 

PS Claim 4; Fig 1-6; 43pp; Japanese. 
XX 

CC This invention relates to DNA comprising a promoter region containing the 

CC regulatory sequences of human uncoupling protein-2 (UCP-2) . Included in 

CC the invention are a recombinant vector containing the DNA sequence, cells 

CC transformed by the vector, and a method for screening for compounds or 

CC salts that can promote or inhibit the UCP-2 promoter activity using the 

CC transformants . The DNA and cells transformed using it can be used to 

CC screen for anti-obesity, anti-diabetic, hypotensive, anti-hyper lipidaemic 

CC and anti-pyretic drugs. The present sequence represents DNA containing 

CC the UCP-2 promoter sequences 
XX 

SQ Sequence 3505 BP; 671 A; 1053 C; 894 G; 887 T; 0 U; 0 Other; 



Query Match 100.0%; Score 637; DB 3; Length 3505; 

Best Local Similarity 100.0%; Pred. No. 3.4e-133; 

Matches 637; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 




Db 


1634 


ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 


1693 


Qy 


61 


GAC AGC GAGGC C GGT C GC GAGGCC C CAG TC CC GCC C T GCAGGAGC C AGCC GC GC GC T C GC 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1694 


GAC AGC GAGGCCGGTCGC GAGGCC C CAG TCCCGCCCTGCAGGAGCC AGC CGCGCGCTCGC 


1753 


Qy 


121 


TC GCAGGAGGG TGGGTAGT T TGCC C AGC GT AGGGGGGC T GGGCC C ATAAAAGAGGAAGTG 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1754 


TCGCAGGAGGGTGGGTAGTTTGCCCAGCGTAGGGGGGCTGGGCCCATAAAAGAGGAAGTG 


1813 
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Qy 181 C AC T T AAGAC ACGGC CCCGCT GGAC GC T T GT T AGAAAC CG T CC T GGC TGGGAAGGC AAGA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I 
Db 1814 C AC T T AAGAC ACGGC CCCGCTGGAC GC T T GT T AGAAAC C G TCC T GGC TGGGAAGGC AAGA 1873 

Qy 241 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1874 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 1933 

Qy 301 GC GGCC C GAGAGAGT GTGAGGC AGAGGC GGGGAGT GGC AAGGGAGTGAC CAT C TC GGGGA 360 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1934 GCGGCCCGAGAGAGTGTGAGGCAGAGGCGGGGAGTGGCAAGGGAGTGACCAT C TCGGGGA 1993 

Qy 361 AC GAAGGAGT AAACGCGGT GATGGGAC GC ACGGAAAC GGGAGT GGAGAAAGT C AT GGAGA 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I II I I I I I I I 

Db 1994 AC GAAGGAGTAAACGCGGT GATGGGAC GC AC GGAAAC GGGAGT GGAGAAAGT CAT GGAGA 2053 

Qy 421 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 480 

II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 2054 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 2113 

Qy 481 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 540 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2114 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 2173 

Qy 541 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 600 

I I I I I I I I I I II I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2174 T TAAGC C GC GCC GCC GCC T GC GCGGAGC C CC AC TGC GAAGC C C AGC T GC GCGC GCC T T GG 2233 

Qy 601 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 > 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2234 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 2270 



RESULT 2 
ADG65405 

ID ADG65405 standard; DNA; 9314 BP. 
XX 

AC ADG65405; 
XX 

DT ll-MAR-2004 (first entry) 
XX 

DE Human uncoupling protein 2 (UCP2) gene. 
XX 

KW anorectic; antidiabetic; immunomodulator ; gene therapy; haplotyping; 

KW uncoupling protein 2; mitochondrial; proton carrier; UCP2; 

KW polymorphic site; haplotype; haplotype pair; obesity; diabetes; 

KW immunological disorder; body mass defect; thermoregulation defect; human; 

KW gene; ds; SNP; single nucleotide polymorphism. 

XX 

OS Homo sapiens.. 



XX 

FH Key Location/Qualifiers 

FT variation 1283 

FT /*tag= a 

FT /standard_name= "Single nucleotide polymorphism" 

FT variation 1714 

FT /*tag= b 

FT /standard_name= "Single nucleotide polymorphism" 

FT variation 2051 

FT /*tag= c 
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o^wivu, jcoiui ivcauitd j^cuuia lkji rvppiioauun u?ou7U7o aiiu ocaiiai xvcdun us-u7-ou7-u. 



age j u± 



FT /standard_name= "Single nucleotide 

FT variation 2124 

FT /*tag= d 

FT /standard_name= "Single nucleotide 

FT variation 2287 

FT /*tag= e 

FT /standard_name= "Single nucleotide 

FT variation 2408 

FT /*tag= f 

FT /standard_name= "Single nucleotide 

FT variation 4768 

FT /*tag= g 

FT /standard_name= "Single nucleotide 

FT variation 4785 

FT /*tag= h 

FT /standard_name= "Single nucleotide 

FT variation 4813 

FT /*tag= i 

FT /standard_name= "Single nucleotide 

FT variation 4882 

FT /*tag= j 

FT /standard_name= "Single nucleotide 

FT variation 4 97 6 

FT /*tag= k 

FT /standard_name= "Single nucleotide 

FT CDS 5281. .8649 

FT /*tag= m 

FT /product= "Uncoupling protein 2" 

FT exon 5281. .5406 

FT /*tag= 1 

FT /number= 1 

FT intron 5407. .5562 

FT /*tag= n 

FT /number= 1 

FT exon 5563. .5773 

FT /*tag= o 

FT / number = 2 

FT variation 5600 
FT /*tag= p 

FT / standard_name= 

FT intron 5774. .6641 

FT /*tag^= q 

FT /number= 2 

FT variation 5820 
FT /*tag= r 

FT /standard_name= "Single nucleotide 

FT variation 6536 

FT /*tag= s 

FT /standard_name= "Single nucleotide 

FT variation 6607 

FT . - /*tag= t 

FT /standard_name= "Single nucleotide 

FT variation 6617 

FT /*tag= u 

FT /standard_name= "Single nucleotide 

FT exon 6642. .6836 

FT /*tag= v 

FT /number= 3 

FT intron 6837. .6916 

FT /*tag= w 

FT /number= 3 



polymorphism" 
po 1 ymo r phi sm " 
polymorphi sm" 
polymorphism" 
polymorphism" 
po 1 ymo r phi sm " 
polymorphism" 
polymorphism" 
po 1 ymo r phi sm " 



'Single nucleotide polymorphism" 



polymorphism" 
polymorphism" 
polymorphism" 
polymorphism" 
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FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
PA 
PA 
PA 
PA 
XX 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
PT 
XX 



variation 6872 

/*tag= x 

/standard_name= "Single nucleotide polymorphism" 
exon 6917. .7018 

/*tag= y 

/number= 4 
variation 6966 

/*tag= z 

/standard_name= "Single nucleotide polymorphism" 
intron 7019. .7984 

/*tag= aa 

/number= 4 
variation 7036 

/*tag= ab 

/standard_name= "Single nucleotide polymorphism" 
variation 7086 

/*tag= ac 

/standard_name= "Single nucleotide polymorphism" 
exon 7985. .8165 

/*tag= ad 

/number= 5 
variation 8100 

/*tag= ae 

/standard_name= "Single nucleotide polymorphism" 
intron 8166. .8534 

/*tag= af 

/number = 5 
variation 8221 

/*tag= ag 

/standard_name= "Single nucleotide polymorphism" 
exon 8535. .8649 

/*tag= ah 

/number= 6 
variation 8677 

/*tag= ai 

/standard_name= "Single nucleotide polymorphism" 

US2003207284-A1. 
06-NOV-2003. 

16-JUL-2002; 2002US-00197019 . 

25-JAN-2001; 2001WO-US002485 . 

(CHEW/) CHEW A. 

(DENT/) DENTON R R. 

(GILS/) GILSON C R. 

(NAND/) NAN DAB ALAN K. 

(PARK/) PARKS K E. 

Chew A, Denton RR, Gilson CR, Nandabalan K, Parks KE; 

WPI; 2004-051505/05. 
P-PSDB; ADG65407. 

Haplotyping Uncoupling Protein 2 gene of an individual comprises 
identifying the phased sequence of nucleotides at polymorphic sites of 
the gene and assigning a haplotype or haplotype pair consistent with the 
phased sequence. 
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o^wivl; occu^u ivcsuils L/CLdio hji r^ppii^auuii U70U7U70 aiiu ocaii/iiivcauii U5-V7-OU7-U... ' 



PS Claim 1; SEQ ID NO 1; 64pp; English. 
XX 

CC The invention describes haplotyping the uncoupling protein 2 

CC (mitochondrial, proton carrier) (UCP2) gene of an individual comprising 

CC identifying the phased sequence of nucleotides at polymorphic sites (PS)1 

CC -23 for at least one copy of the individual's UCP2 gene and assigning to 

CC the individual a UCP2 haplotype or haplotype pair that is consistent with 

CC the phased sequence. The composition and methods are useful in 

CC haplotyping and/or genotyping the UCP2 gene in an individual to e.g. 

CC screen for drugs targeting the UCP2 protein to treat a condition or 

CC disease predicted to be associated with UCP2 activity. The disease or 

CC condition may include obesity, diabetes, immunological disorders and 

CC other diseases associated with defects in body mass and thermoregulation. 

CC This sequence represents the human uncoupling protein 2 (UCP2) gene. 

XX 

SQ Sequence 9314 BP; 1904 A; 2619 C; 2371 G; 2397 T; 0 U; 23 Other; 

Query Match 94.2%; Score 599.8; DB 12; Length 9314; 

Best Local Similarity 98.4%; Pred. No. 8.2e-125; 

Matches 627; Conservative 0; Mismatches 7; Indels 3; Gaps 2; 

Qy 1 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 248 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 307 

Qy 61 GAC AGC GAGGC C GGT CGCGAGGC C C CAG TCCCGCCCT GCAGGAGC C AGC C GC GCGC TC GC 120 

I I I I I II I I I I I I I I I I I I 1 II I I I I I I I I 1 I 1 I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 308 GAC AGCGAGGCCGGTCGCGAGGCCCCAGTCCCGCCC TGCAGGAGCCAGCCGCGCGC TCGC 367 

Qy 121 TC GC AGGAGGGT GGGTAGT T TGC C C AGC GT AGGGGGGC TGGGCC C ATAAAAG AGGAAGTG 180 

I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 368 TCGCAGGAGGGTGGGTAGTTTGCCCAGCGTA-GGGGGCTGGGCCCATAAAAGAGGAAGTG 426 

Qy 181 C AC T TAAGAC AC GGC CCC GC TGGACGC T TGT T AGAAAC CG TC C T GGC T GGGAAGGC AAGA 240 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 427 CACTTAAGACACGGCCCAG — TGGAC GC TGT T AGAAAC C GTC CTGGC TGGGAAGGCAAGA 484 

Qy 241 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I 
Db 485 GGT GT GT GAC T GGAC AAGAC TTGT T TC T GGCGGTC AGTC T TGCC AT C C TC AC AGAGGT T G 544 

Qy 301 GCGGCCCGAGAGAGTGTGAGGCAGAGGC GGGGAGTGGCAAGGGAGTGACCAT CTCGGGGA 360 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I II I I I I I I II 
Db 545 GCGGCCC GAGAGAGT GTGAGGCAGAGGC GGGGAGTGGCAAGGGAGTGACCAT C TC GGGGA 604 

Qy 361 ACGAAGGAGTAAACGC GGT GATGGGACGC ACGGAAAC GGGAGTGGAGAAAGT CAT GGAGA 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 605 ACGAAGGAGTAAACGCGGT GATGGGACGC ACGGAAAC GGGAGTGGAGAAAGT CATGGAGA 664 

Qy 421 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I 
Db 665 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 724 

Qy 481 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 725 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 784 

Qy 541 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 785 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 844 
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Qy 601 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 845 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 881 



RESULT 3 
AAV44974 

ID AAV44974 standard; cDNA; 736 BP. 
XX 

AC AAV44974; 
XX 

DT 16-NOV-1998 (first entry) 
XX 

DE UCP2 gene transcriptional promoter sequence. 
XX 

KW Uncoupling protein 2; UCP2 gene; transcriptional promoter; mouse; 

KW mitochondrial protein; cis transcriptional regulatory activity; therapy; 

KW expression modulator screening; fat feeding; diabetes; obesity; ds . 

XX 

OS Mus sp. 
XX 

PN US5807740-A. 
XX 

PD 15-SEP-1998. 
XX 

PF 25-APR-1997; 97US-00846012 . 
XX 

PR 25-APR-1997; 97US-00846012 . 
XX 

PA (TULA-) TULARIK INC. 
XX 

PI Chen J, Amaral MC; 
XX 

DR WPI; 1998-520130/44. 
XX 

PT Mouse UCP2 gene promoter - useful for forming trans fected cell lines 

PT employed in drug screening assays . 

XX 

PS Claim 1; Col 7-8; 9pp; English. 
XX 

CC This sequence represents the mouse mitochondrial uncoupling protein 2 

CC (UCP2) gene transcriptional promoter of the invention. The promoter has 

CC cis transcriptional regulatory activity. Cells containing the promoter 

CC attached to a non-UCP2 gene, in which the non-UCP2 gene is a reporter 

CC gene can be used in screening assays for modulators of UCP2 gene 

CC expression, which may be useful for treating disorders in which the UCP2 

CC gene is upregulated in response to fat feeding, e.g. diabetes and obesity 
XX 

SQ Sequence 736 BP; 119 A; 236 C; 235 G; 146 T; 0 U; 0 Other; 

Query Match . 76.1%; Score 485; DB 2; Length 736; 
Best Local Similarity 99.6%; Pred. No. 3.4e-99; 

Matches 507; Conservative 0; Mismatches 0; Indels 2; Gaps 2; 

Qy 129 GGG TGGGT AGT T TGC CC AGCGTAGGGGGGCT GGGCC C ATAAAAGAGGAAGT GC AC T TAAG 188 

I I I I I! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 

Db 1 GGGTGGGTAGT T TGC CC AGCGTAGGGGGGC TGGGC C CATAAAAGAGGAAGT GCAC T TAAG 60 

Qy 189 ACACGGCCCCGCTGGACGCTTGTTAGAAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTG 248 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II 

Db 61 ACACGGCCCCGCTGGACGC-TGTTAGAAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTG 119 
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Qy 24 9 ACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTGGCGGCCCG 308 

I I I I I I I I I I I I I I I I I I I II I I i I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I 
Db 120 ACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTGGCGGCCCG 179 

Qy 309 AGAGAGT GTGAGGC AGAGGC GGGGAGTGGCAAGGGAGTGACCAT C T C GGGGAAC GAAGGA 368 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 180 AGAGAGT GTGAGGC AGAGGC GGGGAGTGGCAAGGGAGTGACCAT C T C GGGGAAC GAAGGA 239 

Qy 369 GTAAACGCGGT GATGGGAC GC ACGGAAAC GGGAGTGGAGAAAGT C ATGGAGAGAAC CC TA 428 

I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 240 GTAAACGCGGT GATGGGAC GC AC GGAAAC GGGAGTGGAGAAAGT C ATGGAGAGAAC CC T A 2 99 

Qy 429 GGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCT 488 

I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I II II I I I I I I I I I 
Db 300 GGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAG-T 358 

Qy 489 GGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCG 548 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II M I I I I I II I I I I I I I I I I I I I I I I I 
Db 359 GGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCG 418 

Qy 549 CGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACT 608 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 419 CGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACT 478 

Qy 609 GTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 47 9 GTCCACGCTCGCCCGGCTCGTCCGACGCG 507 



RESULT 4 
AAV08 87 9 

ID AAV08879 standard; cDNA; 736 BP. 
XX 

AC AAV08879; 
XX 

DT 20-MAR-2003 (revised) 

DT 25-FEB-1999 (first entry) 

XX 

DE UCP2 promoter. 
XX 

KW UCP2; promoter; transcription factor; modulator; diabetes; obesity; 

KW therapy; ds . 

XX 

OS Homo sapiens . 
XX 

PN US5849514-A. 
XX 

PD 15-DEC-1998. 
XX 

PF 19-JUN-1998.; 98US-00100297 . 
XX 

PR 25-APR-1997; 97US-00846012 . 
XX 

PA (TULA-) TULARIK INC. 
XX 

PI Chen J, Amaral MC; 
XX 

DR WPI; 1999-069722/06. 
XX 

PT Screening assay for modulators of UCP2 gene expression - based on 
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PT interaction of transcription factor and defined UCP2 promoter sequence. 
XX 

PS Claim 1; Col 7-8; 9pp; English. 
XX 

CC This sequence represents the UCP2 promoter, and is used in the method of 

CC the invention. The method is a screening assay for agents that modulate 

CC the effect of a transcription factor on a UCP2 promoter comprises 

CC combining the promoter and transcription factor in the presence and 

CC absence of a candidate agent and determining any change in the effect of 

CC the transcription factor on the promoter. The promoter comprises at least 

CC 50 nucleotides of a sequence comprising nucleotides 1-4 60 of the UCP2 

CC promoter. The method is used to identify agents that modulate UCP2 gene 

CC transcription (agents that upregulate UCP2 are potentially useful for 

CC treating diabetes and obesity) . (Updated on 20-MAR-2003 to correct PF 

CC field.) 
XX 

SQ Sequence 736 BP; 119 A; 236 C; 235 G; 146 T; 0 U; 0 Other; 

Query Match 76.1%; Score 48 5; DB 2; Length 736; 

Best Local Similarity 99.6%; Pred. No. 3.4e-99; 

Matches 507; Conservative 0; Mismatches 0; Indels 2; Gaps 2; 

Qy 129 GGGTGGGTAGTTTGCCCAGCGTAGGGGGGCTGGGCCCATAAAAGAGGAAGTGCACTTAAG 188 

I II I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I M I I I I I I I I I I t I I I I I I I 
Db 1 GGGTGGGTAGTTTGCCCAGCGTAGGGGGGCTGGGCCCATAAAAGAGGAAGTGCACTTAAG 60 

Qy 189 ACACGGCCCCGCTGGACGCTTGTTAGAAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTG 248 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ACACGGCCCCGCTGGACGC-TGTTAGAAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTG 119 

Qy 249 ACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTGGCGGCCCG 308 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 120 ACTGGACAAGACTTGTTTC TGGCGGTCAGTCT TGCCATCC TCAC AGAGGTTGGCGGCCCG 179 

Qy 309 AGAGAGT GTGAGGC AGAGGCGGGGAGTGGCAAGGGAGTGACC AT C TC GGGGAAC GAAGGA 368 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 180 AGAGAGT GTGAGGC AGAGGCGGGGAGTGGCAAGGGAGTGACCAT C TC GGGGAAC GAAGGA 239 

Qy 369 GTAAACGCGGT GATGGGAC GC ACGGAAACGGGAGT GGAGAAAGT C ATG GAGAGAAC C C T A 428 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 240 GTAAACGCGGT GATGGGAC GCACGGAAAC GGGAGT GGAGAAAGT C ATGGAGAGAAC C C T A 2 99 

Qy 429 GGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCT 488 

I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 300 GGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAG-T 358 

Qy 489 GGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCG 548 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 1 I I I I I 
Db 359 GGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCG 418 

Qy 549 CGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACT 608 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II 
Db 419 CGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACT 478 

Qy 609 GTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 479 GTCCACGCTCGCCCGGCTCGTCCGACGCG 507 



RESULT 5 
AAV44599 
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ov/vyx\xi ocaiitii xvcauna i>cidiid iui rvppiivauun u70U7v/70 aiiu ocai^n xvcauii ua-v7-ow7... i "age 1 1 ui 



ID AAV44599 standard; DNA; 1161 BP. 
XX 

AC AAV44599; 
XX 

DT 24-NOV-1998 {first entry) 
XX 

DE Human uncoupling protein-2 UCP2 gene clone hUCP2-g2 sequence 2. 
XX 

KW Uncoupling protein-2; UCP2 gene; human; respiration; thermogenesis; 

KW obesity; hyperinsulinaemia; glucose intolerance; diabetes; syndrome X; 

KW hypothermia; wasting; cachexia; anorexia; inflammation; fever; 

KW hyperthermia; gene therapy; diagnosis; ds . 
XX 

OS Homo sapiens. 
XX 

PN W09831396-A1. 
XX 

PD 23-JUL-1998. 
XX 

PF 22-APR-1997; 97WO-US006864 . 
XX 

PR 15-JAN-1997; 97US-00349 60P . 
XX 

PA (UYDU-) UNIV DUKE. 

PA (REGC ) UNIV CALIFORNIA. 

PA (CNRS ) CENT NAT RECH SCI. 

XX 

PI Surwit RS, Collins SA, Warden CH, Seldin MF, Ricquier D; 

PI Bouillaud F; 

XX 

DR WPI; 1998-413823/35. 
XX 

PT Method for treating disease associated with altered UCP-2 expression - by 

PT administering agent which enhances or inhibits UCP-2 activity, 

PT effectively to treat obesity, diabetes, fever, hyperthermia, cachexia 

PT etc . 

XX 

PS Example II; Fig 10b; 98pp; English. 
XX 

CC This is the nucleotide sequence of a region (sequence 1) of the human 

CC uncoupling protein-2 (UCP2) gene present in genomic clone hUCP2-g2 (I- 

CC 1867). 4 Regions (see V44598-601) of hUCP2-g2 are provided. The hUCP2-g2 

CC clone was isolated from a human placenta genomic DNA library constructed 

CC in lambda EMBL3 phage using a partial genomic fragment as probe. Sequence 

CC 2 corresponds to DNA from positions bp -511 to +650 and includes the 

CC putative proximal human UCP2 promoter. The UCP2 gene maps to a 

CC chromosomal region (llql3) linked to obesity and hyperinsulinaemia. The 

CC invention provides methods for the treatment of disorders associated with 

CC diminished or elevated UCP2 expression or activity. An agent which 

CC enhances UCP2 expression (e.g. an expression construct comprising a UCP2 

CC encoding sequence) can be used to treat obesity, diabetes, syndrome X, 

CC hypothermia, hyperinsulinaemia, or glucose intolerance. An inhibitor of 

CC UCP2 (e.g. an antisense construct) is used to treat wasting, anorexia, 

CC inflammation, cachexia, fever or hyperthermia (all claimed) . The 

CC invention also relates to diagnostic and drug screening methodologies 

XX 

SQ Sequence 1161 BP; 209 A; 346 C; 323 G; 234 T; 0 U; 49 Other; 

Query Match 70.2%; Score 447.4; DB 2; Length 1161; 
Best Local Similarity 94.9%; Pred. No. 9.5e-91; 

Matches 516; Conservative 0; Mismatches 22; Indels 6; Gaps 5; 
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Qy 100 AGGAGCCAGCCGCGCGCTCGCTCGCAGGAGGGTGGGTAGTTTGCCCAGCGT — AGGGGGG 157 

I II III I I I I I I III I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I 
Db 2 ANGAACCACCGGCGCGTTCGTTCGCAGGAGGTTGGTTAGTTTGCCCAGGGGTAAGGGGGG 61 

Qy 158 C TGGGCC C ATAAAAG AGGAAGTGC - AC T TAAGAC ACGGC C CC GC TGGACGC T T G T TAGAA 216 

I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 62 C TGGGCC C ATAAAAGAGGAAGT GC AAC T TAAGAC AC GGC C C C C G T TGGACGC T GT TAGAA 121 

Qy 217 ACC GTCC T - GGC TGGGAAGGCAAGAGGTGT GT GAC TGGAC AAGAC TT GT T T C T - GGC GG T 274 

III I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I II I 
Db 122 ACCTTCCTGGGTTGGGAAGGCAAGAGGTGTGTGACTGGACAAGAATTGTTTCTGGGCGGT 181 

Qy 275 C AGTC T T GCC ATCC T C AC AGAGGT TGGC GGC C CGAGAGAG T GT GAGGC AGAGGC GGGGAG 334 

M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 182 CAGTCTT GCCATCCTCACAGAGGT TGGC GGCCCGAGAGAGTGTGAGGC AGAGGC GGGGAG 241 

Qy 335 TGGCAAGGGAGTGAC CATC TC GGGGAAC GAAGGAGT AAAC GC GG TGAT GGGAC GC ACGG- 393 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 242 TGGCAAGGGAGTGACCATC TCGGGGAAC GAAGGAGTAAAC GCGGTGAT GGGAC GCACGGA 301 

Qy 394 AAACGGGAGTGGAGAAAGTCATGGAGAGAACC CTAGGCGGGGCGGTCCCCGC GGAAAGGC 453 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I 1 
Db 302 AAACGGGAGTGGAGAAAGTCATGGAGAGAACC TTAGGCGGGGCGGTCCCCGC GGAAAGGC 361 

Qy 454 GGC TGCTCCAGGGTCTCCGCACCCAAGTAGGAGC TGGC AGGCCCGGCCCCGCCCCGCAGG 513 

I I I I I I I I I II I I I I I I I II I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I II I I 
Db 362 GGC TGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGC AGGCCCGGCCCCGCCCCGCAGG 421 

Qy 514 CCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCAC 573 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 422 CCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCAC 481 

Qy 574 TGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGA 633 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I II I I I I I I 
Db 482 TGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGA 541 

Qy 634 CGCG 637 

I I I I 

Db 542 CGCG 545 



RESULT 6 
ABL33892 

ID ABL33892 standard; DNA; 5283 BP. 
XX 

AC ABL33892; 
XX 

DT 26-MAR-2002 (first entry) 
XX 

DE Human immune system associated gene SEQ ID NO: 18 65. 
XX 

KW Human; immune system disease; cytosine methylation; antiasthmatic; 

KW antiarteriosclerotic; antianaemic; cytostatic; nootropic; 

KW neuroprotective; anti-HIV; anticonvulsant; ophthalmological; 

KW antirheumatic; antiarthritic; antidiabetic; antipsoriatic; 

KW antiinflammatory; cancer; eye disease; arteriosclerosis; anaemia; 

KW acute myeloid leukaemia; Alzheimer's disease; AIDS; epilepsy; 

KW neurofibromatosis; rheumatoid arthritis; psoriasis; bowel disease; gene; 

KW ds. 

XX 
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OS Homo sapiens . 
XX 

PN WO200200928-A2. 
XX 

PD 03-JAN-2002. 
XX 

PF 02-JUL-2001; 2001WO-EP007537 . 
XX 



PR 30-JUN-2000; 2000DE-01032 529 . 

PR 01-SEP-2000; 2000DE-01043826 . 
XX 

PA (EPIG-) EPIGENOMICS AG. 
XX 

PI Olek A, Piepenbrock C, Berlin K; 
XX 

DR WPI; 2002-130909/17. 
XX 

PT Nucleic acid comprising fragment of chemically modified gene, useful for 

PT diagnosis and treatment of diseases associated with abnormal cytosine 

PT methylation . 
XX 

PS Claim 1; SEQ ID NO 1865; 32pp + Sequence Listing; German. 
XX 

CC The present invention provides a number of human immune system associated 

CC genes which are modified by the methylation of cytosines. The sequences 

CC can be used in the diagnosis and treatment of immune system disorders, 

CC including eye diseases such as retinopathy, neovascular glaucoma and 

CC macular degeneration, arteriosclerosis, anaemia, cancer, acute myeloid 

CC leukaemia, Alzheimer's disease, AIDS, epilepsy, neurofibromatosis, 

CC rheumatoid arthritis, psoriasis and inflammatory/ulcerative bowel 

CC diseases. The present sequence is a gene of the invention 
XX 

SQ Sequence 5283 BP; 1097 A; 136 C; 1369 G; 2681 T; 0 U; 0 Other; 



Query Match 57.2%; Score 364.2; DB 6; Length 5283; 

Best Local Similarity 77.4%; Pred. No. 5.1e-72; 

Matches 467; Conservative 0; Mismatches 133; Indels 3; Gaps 2; 



Qy 35 CGACAACGCGGCGGCTGTAACCAATCGACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCGC 94 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II 

Db 2 CGATAACGCGGCGGTTGTAATTAATCGATAGCGAGGTCGGTCGCGAGGTTTTAGTTTCGT 61 

Qy 95 CCTGCAGGAGCCAGCCGCGCGCTCGCTCGCAGGAGGGTGGGTAGTTTGCCCAGCGTAGGG 154 

II I I I I I II I I II I I III III I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 62 TTTGTAGGAGTTAGTCGCGCGTTCGTTCGTAGGAGGGTGGGTAGTTTGTTTAGCGTA-GG 120 

Qy 155 GGGCTGGGCCC ATAAAAGAGGAAGTGCACTTAAGAC ACGGCCCC GC TGGACGC TTGT TAG 214 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 121 GGGT TGGGT T T ATAAAAGAGGAAGTGT AT T T AAGAT ACGGT T TAG — TGGAC GT TGT TAG 178 

Qy 215 AAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTGACTGGACAAGACTTGTTTCTGGCGGT 274 

III III III I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 179 AAATCGTTTTGGTTGGGAAGGTAAGAGGTGTGTGATTGGATAAGATTTGTTTTTGGCGGT 238 

Qy 275 CAGTCTTGCCATCCTCACAGAGGT TGGCGGCC CGAGAGAGTGTGAGGC AGAGGCGGGGAG 334 

III III II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 239 T AGT T T TGTT AT T TT T ATAGAGGT TGGC GGT T C GAGAGAGTGTGAGGT AGAGGC GGGGAG 298 

Qy 335 TGGCAAGGGAGTGAC CATC TC GGGGAAC GAAGGAGT AAAC GCGGTGAT GGGACGC ACGGA 394 

III I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 299 TGGTAAGGGAGTGATTATT TCGGGGAACGAAGGAGTAAAC GCGGTGATGGGACGT ACGGA 358 
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Qy 395 AAC GGGAGTGGAGAAAGT C ATGGAGAGAACC C T AGGC GGGGCGG TC CC C GC GGAAAGGCG 454 

II I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 359 AAC GGGAGT GGAGAAAGT T ATGGAGAGAAT T T TAGGCGGGGCGG T T T T C GC GGAAAGGCG 418 

Qy 455 GCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGC 514 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II I I I I I 
Db 419 GTTGTTTTAGGGTTTTCGTATTTAAGTAGGAGTTGGTAGGTTCGGTTTCGTTTCGTAGGT 478 

Qy 515 CCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACT 574 

I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I II 

Db 479 TTTATTTCGGGTTTCGTTTTCGAGGTTTAAGTCGCGTCGTCGTTTGCGCGGAGTTTTATT 538 

Qy 575 GCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGAC 634 

I I I I I I II I I I I I I I I I I I I I I I I I I III III III III I I I I I I I I 
Db 539 GCGAAGTTTAGTTGCGCGCGTTTTGGGATTGATTGTTTACGTTCGTTCGGTTCGTTCGAC 598 

Qy 635 GCG 637 

I I I 

Db 599 GCG 601 



RESULT 7 
ABL33893/C 

ID ABL33893 standard; DNA; 5283 BP. 
XX 

AC ABL33893; 
XX 

DT 26-MAR-2002 (first entry) 
XX 

DE Human immune system associated gene SEQ ID NO: 18 66. 
XX 

KW Human; immune system disease; cytosine methylation; antiasthmatic; 

KW antiarteriosclerotic; antianaemic; cytostatic; nootropic; 

KW neuroprotective; anti-HIV; anticonvulsant; ophthalmological; 

KW antirheumatic; antiarthritic; antidiabetic; antipsoriatic; 

KW antiinflammatory; cancer; eye disease; arteriosclerosis; anaemia; 

KW acute myeloid leukaemia; Alzheimer's disease; AIDS; epilepsy; 

KW neurofibromatosis; rheumatoid arthritis; psoriasis; bowel disease; gene; 

KW ds. 

XX 

OS Homo sapiens . 
XX 

PN WO200200928-A2. 
XX 

PD 03-JAN-2002. 
XX 

PF 02-JUL-2001; 2001WO-EP007537 . 
XX 

PR 30-JUN-2000; 2000DE-01032529 . 

PR 01-SEP-2000; 2000DE-01043826. 
XX 

PA (EPIG-) EPIGENOMICS AG. 
XX 

PI Olek A, Piepenbrock C, Berlin K; 
XX 

DR WPI; 2002-130909/17. 
XX 

PT Nucleic acid comprising fragment of chemically modified gene, useful for 

PT diagnosis and treatment of diseases associated with abnormal cytosine 

PT methylation. 
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XX 

PS Claim 1; SEQ ID NO 1866; 32pp + Sequence Listing; German. 
XX 

CC The present invention provides a number of human immune system associated 

CC genes which are modified by the methylation of cytosines. The sequences 

CC can be used in the diagnosis and treatment of immune system disorders, 

CC including eye diseases such as retinopathy, neovascular glaucoma and 

CC macular degeneration, arteriosclerosis, anaemia, cancer, acute myeloid 

CC leukaemia, Alzheimer's disease, AIDS, epilepsy, neurofibromatosis, 

CC rheumatoid arthritis, psoriasis and inflammatory/ulcerative bowel 

CC diseases . The present sequence is a gene of the invention 
XX 

SQ Sequence 5283 BP; 1342 A; 136 C; 1475 G; 2330 T; 0 U; 0 Other; 

Query Match 4 9.0%; Score 312.4; DB 6; Length 5283; 

Best Local Similarity 72.0%; Pred. No. 2.1e-60; 

Matches 435; Conservative 0; Mismatches 166; Indels 3; Gaps 2; 

Qy 34 CCGACAACGCGGCGGCTGTAACCAATCGACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCG 93 

I I I I I I I I I I I II II I I I I I I I I I I I I I III III I I I I I I I I I I I I I I I I 
Db 5283 CCGACAACGCGACGACTATAACCAATCGACAACGAAACCGATCGCGAAACCCCAATCCCG 5224 

Qy 94 CCCTGCAGGAGCCAGCCGCGCGCTCGCTCGCAGGAGGGTGGGTAGTTTGCCCAGCGTAGG 153 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 5223 C CC T ACAAAAAC C AAC CGC GCGC T C GC T CGC AAAAAAATAAATAAT T T ACC C AAC GTA- A 5165 

Qy 154 GGGGC TGGGC C C ATAAAAGAGGAAGTGC AC T T AAGAC AC GGC CC C GC T GGAC GC T T GT T A 213 

II I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I 

Db 5164 AAAAC TAAACC C ATAAAAAAAAAAAT AC AC T T AAAACAC G AC CC — AATAAACGC TAT T A 5107 

Qy 214 GAAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTGACTGGACAAGACTTGTTTCTGGCGG 273 

I I I I II I I I I II II I I 1 I I I I I I I I I I I I I I I I I I I I II 
Db 5106 AAAACCGTCCTAACT AAAAAAACAAAAAATAT ATAAC TAAACAAAACT TATT TCTAACGA 5047 

Qy 274 T C AGTC T TGCC ATC C TCAC AGAGGT T GGCGGC CC GAGAGAGT GT GAGGCAGAGGCGGGGA 333 

I I I I I M I I I I I I I I I I I I II I I I I I II I I I I I III II I 
Db 5046 TCAATC T TACC ATCC TCACAAAAATTAACGACCCGAAAAAATATAAAACAAAAACGAAAA 4987 

Qy 334 GTGGCAAGGGAGTGACCATCTCGGGGAACGAAGGAGTAAACGCGGTGATGGGACGCACGG 393 

I III I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 4986 ATAACAAAAAAATAACC AT C TC GAAAAACGAAAAAATAAACGCGATAAT AAAAC GC ACGA 4927 

Qy 394 AAACGGGAGTGGAGAAAGTCATGGAGAGAACCC TAGGCGGGGCGGTCC CCGCGGAAAGGC 453 

I I I I I II I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I 
Db 4926 AAACGAAAATAAAAAAAAT C ATAAAAAAAAC C C TAAAC GAAACGATC C C C GC GAAAAAAC 4867 

Qy 454 GGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGG 513 

I I I I I I I I I I I I I I I I I I I I I I I III II I I I I I I I I I I I I I I I I 
Db 4866 GACTACTCCAAAATCTCCGCACCCAAATAAAAACTAACAAACCCGACCCCGCCCCGCAAA 4807 

Qy 514 CCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCAC 573 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4806 CCCCACCCCGAACCCCGCCCCCGAAACTTAAACCGCGCCGCCGCCTACGCGAAACCCCAC 4747 

Qy 574 TGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGA 633 

I II I I I I II II I I I I I I I I I I III III I I I I I I I I I I I I I I I I I I I I I I I 
Db 474 6 TAC GAAACCC AACTACGCGCGC C T TAAAAT TAAC TAT CC ACGC T C GC C CGAC TC GT CC GA 4687 

Qy 634 CGCG 637 

I I I I 

Db 4686 CGCG 4683 
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ov^v^xnjj, ocaivn lvc^uiid lycuuia iui rvpjjuvauuii aiiu ocaivn ivcauit ua-u7-ou7... A " a 5 c AW ^ A 



RESULT 8 
ADP64964 

ID ADP64964 standard; DNA; 1646 BP. 
XX 

AC ADP64964; 
XX 

DT 12-AUG-2004 (first entry) 
XX 

DE Human uncoupling protein 2 (mitochondrial, proton carrier) DNA sequence. 
XX 

KW autoimmune disease; arthritide; gene expression analysis; 

KW rheumatoid arthritis; collagen- induced; immunosuppressive; antirheumatic; 

KW antiarthritic; osteopathic; antigout; antiinflammatory; derma tological; 

KW immunomodulatory; lupus; ankylosing spondylitis; Fibrositis; 

KW fibromyalgia; osteoarthritis; gout; juvenile rheumatoid arthritis; 

KW immune; ds; human. 

XX 

OS Homo sapiens . 
XX 

PN WO2003072827-A1. 
XX 

PD 04-SEP-2003. 
XX 

PF 31-OCT-2002; 2002WO-US035433 . 
XX 

PR 31-OCT-2001; 2001US-0336220P. 
XX 

PA (CHIL-) CHILDREN'S HOSPITAL MEDICAL CENT. 
XX 

PI Hirsch R, Thorton SL; 
XX 

DR WPI; 2003-712740/67. 

DR GENBANK; NM_003355. 
XX 

PT Diagnosing and analyzing autoimmune disease using gene expression 

PT profiles and microarray technology, useful for diagnosing and treating 

PT rheumatoid arthritis, lupus, fibrositis, osteoarthritis, fibromyalgia and 

PT gout . 

XX 

PS Disclosure; Page; 56pp; English. 
XX 

CC The invention relates to a novel method for diagnosing and analysing 

CC autoimmune disease or arthritides. The method comprises obtaining a 

CC patient sample containing mRNA, analysing gene expression using the mRNA 

CC that results in a gene expression signature of the mRNA, and using that 

CC gene expression signature to diagnose or analyse the autoimmune disease 

CC or arthritides in the patient, where gene expression of at least 60% of 

CC the genes correlates with that of the gene signature. The invention 

CC further comprises: a treatment of rheumatoid arthritis; identification of 

CC genes for targeting in the treatment of rheumatoid arthritis in a mammal 

CC other than a mouse; diagnosis of rheumatoid arthritis in a mammal; an 

CC array or gene chip, specific for rheumatoid arthritis; diagnosis or 

CC analyses of autoimmune disease or rheumatoid arthritis; screening the 

CC efficacy of a candidate drug in vitro for the treatment of collagen- 

CC induced arthritis; and reducing the symptoms associated with collagen- 

CC induced arthritis. The compositions of the invention have the following 

CC activities: immunosuppressive, antirheumatic, antiarthritic, osteopathic, 

CC antigout, antiinflammatory, dermatological, and immunomodulatory. The 

CC methods and compositions of the present invention are useful for 
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CC diagnosing and treating autoimmune disease or arthritides, such as 

CC rheumatoid arthritis, lupus, ankylosing spondylitis, fibrositis, 

CC fibromyalgia, osteoarthritis, gout, juvenile rheumatoid arthritis, and an 

CC immune disease caused by an infectious agent. This polynucleotide 

CC represents a DNA sequence relating to the genes used in the analysis and 

CC treatment of autoimmune diseases or arthritides. Note: This sequence is 

CC not shown in the specification. It has been supplied in an electronic 

CC format from WIPO. 

XX 

SQ Sequence 1646 BP; 309 A; 535 C; 405 G; 397 T; 0 U; 0 Other; 

Query Match 10.5%; Score 67; DB 11; Length 1646; 

Best Local Similarity 100.0%; Pred. No. 1.9e-05; 

Matches 67; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 571 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 630 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I 11 I I I I I I 
Db 1 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 60 

Qy 631 CGACGCG 637 

I I I I I I I 
Db 61 CGACGCG 67 



RESULT 9 
ADJ75174 

ID ADJ7 5174 standard; DNA; 1646 BP. 
XX 

AC ADJ7 5174; 
XX 

DT 20-MAY-2004 (first entry) 
XX 

DE Marker gene SEQ ID NO: 426. 
XX 

KW bronchial asthma; chronic obstructive pulmonary disease; 

KW respiratory epithelial cell; interleukin-13; respiratory; antiasthmatic; 

KW gene therapy; marker gene; gene; ds . 

XX 

OS Homo sapiens . 
XX 

PN EP1394274-A2. 
XX 

PD 03-MAR-2004. 
XX 

PF 04-AUG-2003; 2003EP-00254857 . 
XX 

PR 06-AUG-2002; 2002 JP-00229312 . 

PR 20-MAR-2003; 2003JP-00077212 . 
XX 

PA (GENO-) GENOX RES INC. 
XX 

PI Ohtani N, Sugita Y, Yamaya M, Kubo H, Nagai H, Izuhara K; 
XX 

DR WPI; 2004-193155/19. 
XX 

PT Testing for bronchial asthma or chronic obstructive pulmonary disease by 

PT comparing the expression level of a marker gene in a biological sample 

PT from a subject with the expression level of the gene in a sample from a 

PT healthy subject. 
XX 

PS Claim 1; SEQ ID NO 426; 241pp; English. 
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XX 

CC The present invention describes a method of testing for bronchial asthma 

CC or chronic obstructive pulmonary disease. The method comprises 

CC determining the expression level of a marker gene in a biological sample 

CC from a subject, comparing the expression level determined with the 

CC expression level of the marker gene in a biological sample from a healthy 

CC subject, and judging whether the subject has bronchial asthma or chronic 

CC obstructive pulmonary disease. The marker gene comprises: (a) a group of 

CC genes (SI) whose expression levels increase when respiratory epithelial 

CC cells are stimulated with interleukin-13; or (b) a group of genes (S2) 

CC whose expression levels decrease when respiratory epithelial cells are 

CC stimulated with interleukin-13. Also described: (1) a reagent (I) for 

CC testing for bronchial asthma or chronic obstructive pulmonary disease; 

CC (2) a kit for screening for a candidate compound for a therapeutic agent 

CC to treat bronchial asthma or chronic obstructive pulmonary disease; (3) 

CC an animal model for bronchial asthma or chronic obstructive pulmonary 

CC disease; (4) an inducer that induces bronchial asthma in a mouse; (5) a 

CC method for producing an animal model for bronchial asthma or chronic 

CC obstructive pulmonary disease; (6) a therapeutic agent for bronchial 

CC asthma or chronic obstructive pulmonary disease, comprising the compound, 

CC a marker gene or an antisense nucleic acid corresponding to a portion of 

CC the marker gene, a ribozyme, a polynucleotide that suppresses the 

CC expression of the gene through an RNAi effect or an antibody recognising 

CC a protein encoded by a marker gene; and (7) a DNA chip for testing for 

CC bronchial asthma or a chronic obstructive pulmonary disease, on which a 

CC probe has been immobilised to assay a marker gene. (I) has respiratory 

CC and antiasthmatic activities, and can be used in gene therapy. The method 

CC is useful for testing for or screening for a therapeutic agent for 

CC bronchial asthma or chronic obstructive pulmonary disease. The present 

CC sequence is used in the exemplification of the present invention. 

XX 

SQ Sequence 164 6 BP; 309 A; 535 C; 405 G; 397 T; 0 U; 0 Other; 

Query Match 10.5%; Score 67; DB 12; Length 1646; 

Best Local Similarity 100.0%; Pred. No. 1.9e-05; 

Matches 67; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 571 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 630 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I 11 I I I I I i I I I I I I II I I I I I I I I I I I I I I I 
Db 1 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 60 



Qy 631 CGACGCG 637 

I I I I I I I 
Db 61 CGACGCG 67 



RESULT 10 
ADY16091 

ID ADY16091 standard; DNA; 1646 BP. 
XX 

AC ADY16091; 
XX 

DT 05-MAY-2005 (first entry) 
XX 

DE DNA encoding a PRO polypeptide, SEQ ID NO 1897. 
XX 

KW Antiinflammatory; Immune disorder; Dermatological; Immunosuppressive; 

KW Antirheumatic; Antiarthritic; Osteopathic; Hemostatic; Antianemic; 

KW Antithyroid; Antidiabetic; Nephrotropic; CNS-Gen.; Hepatotropic; 

KW Virucide; Gastrointestinal-Gen.; Antipsoriatic; Antiasthmatic; 

KW Antiallergic; ds; gene; diagnosis. 
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XX 

OS Homo sapiens . 
XX 

PN WO2005016962-A2. 
XX 

PD 24-FEB-2005. 
XX 

PF ll-AUG-2004; 2004WO-US026249 . 
XX 

PR ll-AUG-2003; 2003US-0493546P. 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Abbas A, Clark H, Ouyang W, Williams MP, Wood WI, Wu TD; 
XX 

DR WPI; 2005-182330/19. 
XX 

PT New nucleic acid encoding PRO polypeptide, useful for diagnosing and 

PT treating an immune related disorder, e.g. systemic lupus erythematosus, 

PT rheumatoid arthritis, osteoarthritis, thyroiditis, or diabetes mellitus. 
XX 

PS Claim 1; SEQ ID NO 1897; 158pp; English. 
XX 

CC The invention relates to an isolated nucleic acid encoding a PRO 

CC polypeptide. The polypeptide, agonist or an antagonist, antibody, 

CC composition, and method are useful for diagnosing and treating an immune 

CC related disorder, e.g. systemic lupus erythematosus, rheumatoid 

CC arthritis. The present sequence represents a DNA encoding a PRO 

CC polypeptide. 

XX 

SQ Sequence 164 6 BP; 309 A; 535 C; 405 G; 397 T; 0 U; 0 Other; 



Query Match 10.5%; Score 67; DB 14; Length 1646; 

Best Local Similarity 100.0%; Pred. No. 1.9e-05; 

Matches 67; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 571 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 630 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I f I I 1 I I I I I I I I I I I I I I I I I t I 
Db 1 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 60 



Qy 631 CGACGCG 637 

I I I I I I I 
Db 61 CGACGCG 67 



RESULT 11 
ADP65417 

ID ADP65417 standard; DNA; 8177 BP. 
XX 

AC ADP65417; 
XX 

DT 12-AUG-2004 (first entry) 
XX 

DE Human uncoupling protein-2 (UCP2) gene, nuclear gene DNA. 
XX 

KW autoimmune disease; arthritide; gene expression analysis; 

KW rheumatoid arthritis; collagen-induced; immunosuppressive; antirheumatic; 

KW antiarthritic; osteopathic; antigout; antiinflammatory; derma tological; 

KW immunomodulatory; lupus; ankylosing spondylitis; Fibrositis; 

KW fibromyalgia; osteoarthritis; gout; juvenile rheumatoid arthritis; 

KW immune; ds; human. 
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XX 

OS Homo sapiens . 
XX 

PN WO2003072827-A1 . 
XX 

PD 04-SEP-2003. 
XX 

PF 31-OCT-2002; 2002WO-US035433 . 
XX 

PR 31-OCT-2001; 2001US-0336220P. 
XX 

PA (CHIL-) CHILDREN'S HOSPITAL MEDICAL CENT. 
XX 

PI Hirsch R, Thorton SL; 
XX 

DR WPI; 2003-712740/67. 

DR GENBANK; AF096289. 
XX 

PT Diagnosing and analyzing autoimmune disease using gene expression 

PT profiles and microarray technology, useful for diagnosing and treating 

PT rheumatoid arthritis, lupus, fibrositis, osteoarthritis, fibromyalgia and 

PT gout . 

XX 

PS Disclosure; Page; 56pp; English. 
XX 

CC The invention relates to a novel method for diagnosing and analysing 

CC autoimmune disease or arthritides. The method comprises obtaining a 

CC patient sample containing mRNA, analysing gene expression using the mRNA 

CC that results in a gene expression signature of the mRNA, and using that 

CC gene expression signature to diagnose or analyse the autoimmune disease 

CC or arthritides in the patient, where gene expression of at least 60% of 

CC the genes correlates with that of the gene signature. The invention 

CC further comprises: a treatment of rheumatoid arthritis; identification of 

CC genes for targeting in the treatment of rheumatoid arthritis in a mammal 

CC other than a mouse; diagnosis of rheumatoid arthritis in a mammal; an 

CC array or gene chip, specific for rheumatoid arthritis; diagnosis or 

CC analyses of autoimmune disease or rheumatoid arthritis; screening the 

CC efficacy of a candidate drug in vitro for the treatment of collagen- 

CC induced arthritis; and reducing the symptoms associated with collagen- 

CC induced arthritis. The compositions of the invention have the following 

CC activities: immunosuppressive, antirheumatic, antiarthritic, osteopathic, 

CC antigout, antiinflammatory, derma tological, and immunomodulatory. The 

CC methods and compositions of the present invention are useful for 

CC diagnosing and treating autoimmune disease or arthritides, such as 

CC rheumatoid arthritis, lupus, ankylosing spondylitis, fibrositis, 

CC fibromyalgia, osteoarthritis, gout, juvenile rheumatoid arthritis, and an 

CC immune disease caused by an infectious agent. This polynucleotide 

CC represents a DNA sequence relating to the genes used in the analysis and 

CC treatment of autoimmune diseases or arthritides. Note: This sequence is 

CC not shown in the specification. It has been supplied in an electronic 

CC format from WIPO. 

XX 

SQ Sequence 8177 BP; 1708 A; 2261 C; 2020 G; 2188 T; 0 U; 0 Other; 

Query Match 10.5%; Score 67; DB 11; Length 8177; 

Best Local Similarity 100.0%; Pred. No. 2.3e-05; 

Matches 67; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 571 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 630 

I I I I I I I I II I I I II I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 60 
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Qy 631 CGACGCG 637 

I I I I I I I 
Db 61 CGACGCG 67 



RESULT 12 
AAS94928 

ID AAS94928 standard; DNA; 1643 BP. 
XX 

AC AAS94928; 
XX 

DT 14-FEB-2002 (first entry) 
XX 

DE Human DNA sequence #183 expressed during foam cell differentiation. 
XX 

KW Human; foam cell differentiation; atherosclerosis; cerebral stroke; 

KW cardiovascular disorder; coronary artery disease; gene therapy; ds . 
XX 

OS Homo sapiens . 
XX 

PN WO200177389-A2. 
XX 

PD 18-OCT-2001. 
XX 

PF 04-APR-2001; 2001WO-US011128 . 
XX 

PR 05-APR-2000; 2000US-0195106P . 
XX 

PA (INCY-) INCYTE GENOMICS INC. 
XX 

PI Shiffman D, Somogyi R, Lawn R, Seilhamer JJ, Porter GJ, Mikita T; 

PI Tai J; 

XX 

DR WPI; 2002-010925/01. 
XX 

PT Composition useful for diagnosis of conditions, disorders or diseases 

PT associated with atherosclerosis, comprises several polynucleotides that 

PT are differentially expressed in foam cell development. 
XX 

PS Claim 1; Page 238; 315pp; English. 
XX 

CC The present invention relates to the isolation of human polynucleotide 

CC sequences that are differentially expressed during foam cell 

CC differentiation. The polynucleotide sequences of the invention or a 

CC composition comprising these polynucleotides are useful as a high 

CC throughput method for detecting altered expression of one or more 

CC polynucleotides in a sample. The polynucleotides can be used in the 

CC diagnosis of disorders associated with foam cell development such as 

CC atherosclerosis, cerebral stroke, and cardiovascular disorders such as 

CC coronary artery disease. The polynucleotide sequences can also be used as 

CC PCR primers and probes. The polynucleotides of the invention are also 

CC useful in gene therapy. AAS94746-AAS95021 represent the human 

CC polynucleotide sequences of the invention which are differentially 

CC expressed during foam cell differentiation 

XX 

SQ Sequence 1643 BP; 311 A; 533 C; 402 G; 397 T; 0 U; 0 Other; 

Query Match 9.1%; Score 58; DB 6; Length 164 3; 
Best Local Similarity 100.0%; Pred. No. 0.002; 

Matches 58; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
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Qy 580 GCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I 
Db 1 GCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 58 



RESULT 13 
AAC10386 

ID AAC10386 standard; cDNA; 161 BP. 
XX 

AC AAC10386; 
XX 

DT 06-OCT-2000 (first entry) 
XX 

DE Human secreted protein 5' EST, SEQ ID NO: 14461. 
XX 

KW Human; 5 ! EST; expressed sequence tag; secreted protein; cDNA isolation; 

KW gene therapy; chromosome mapping; ss. 

XX 

OS Homo sapiens. 
XX 

PN EP1033401-A2. 
XX 

PD 06-SEP-2000. 
XX 

PF 21-FEB-2000; 2000EP-00200610 . 
XX 

PR 26-FEB-1999; 99US-0122487P. 
XX 

PA (GEST ) GENSET. 
XX 

PI Dumas Milne Edwards J, Duclert A, Giordano J; 
XX 

DR WPI; 2000-500381/45. 
XX 

PT New nucleic acid that is a 5 1 expressed sequence tag (5 1 EST) for 

PT obtaining cDNAs and genomic DNAs that correspond to 5' ESTs and for 

PT diagnostic, forensic, gene therapy and chromosome mapping procedures. 
XX 

PS Claim 1; SEQ ID NO 14461; 71pp + Sequence Listing; English. 
XX 

CC The present sequence is one of a large number of 5* ESTs derived from 

CC mRNAs encoding secreted proteins. No ORF has yet been conclusively 

CC identified within the present sequence. The 5* ESTs were prepared from 

CC total human RNAs or polyA+ RNAs derived from 30 different tissues. EST 

CC sequences usually correspond mainly to the 3' untranslated region (UTR) 

CC of the mRNA because they are often obtained from oligo-dT primed cDNA 

CC libraries. Such ESTs are not well suited for isolating cDNA sequences 

CC derived from the 5' ends of mRNAs and even in those cases where longer 

CC cDNA sequences have been obtained, the full 5 1 UTR is rarely included. 5* 

CC ESTs are derived from mRNAs with intact 5' ends and can therefore be used 

CC to obtain full length cDNAs and genomic DNAs. 5 1 ESTs are also used in 

CC diagnostic, forensic, gene therapy and chromosome mapping procedures. 

CC They are used to obtain upstream regulatory sequences and to design 

CC expression and secretion vectors 
XX 

SQ Sequence 161 BP; 25 A; 64 C; 41 G; 30 T; 0 U; 1 Other; 

Query Match 8.9%; Score 57; DB 3; Length 161; 
Best Local Similarity 96.6%; Pred. No. 0.0026; 

Matches 57; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 
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Qy 57 9 AGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I : I I I I I II I I I I I I I I I I I I I I I I I I I I I I i II I I I I I I I I I I I II I I I I I I I I I I I 
Db 9 AASCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 67 



RESULT 14 
ADB87421 

ID ADB8 7421 standard; DNA; 2500 BP. 
XX 

AC ADB87421; 
XX 

DT 04-DEC-2003 (first entry) 
XX 

DE Transgene expression regulatory element, STAR66F. 
XX 

KW gene transcription; regulatory; variety fragment; STAR element; 

KW transgene expression; ds . 

XX 

OS Unidentified. 
XX 

PN WO2003004704-A2 . 
XX 

PD 16-JAN-2003. 
XX 

PF 14-JUN-2002; 2002WO-NL000390 . 
XX 

PR 04-JUL-2001; 2001EP-00202581 . 

PR 05-JUL-2001; 2001US-0303199P . 
XX 

PA (CHRO-) CHROMAGENICS BV. 
XX 

PI Otte AP, Kruckeberg AL; 
XX 

DR WPI; 2003-229412/22. 
XX 

PT Selecting a DNA sequence with a gene transcription modulating quality by 

PT providing a transcription system with a variety fragment -comprising 

PT vectors and performing a selection step in the transcription system. 
XX 

PS Claim 43; Fig 26; 216pp; English. 
XX 

CC The invention relates to DNA sequences with gene transcription regulatory 

CC qualities and methods for the detection and use of the regulatory DNA 

CC sequences. The invention further comprises providing a transcription 

CC system with a variety fragment-comprising vectors; and performing a 

CC selection step in the transcription system in order to identify a 

CC fragment comprising the DNA sequence with the gene transcription 

CC modulating activity. This polynucleotide represents a STAR element used 

CC in the method of the invention. The STAR element is shown to improve 

CC transgene expression. 

XX 

SQ Sequence 2500 BP; 295 A; 902 C; 981 G; 322 T; 0 U; 0 Other; 

Query Match 8.1%; Score 51.8; DB 10; Length 2500; 
Best Local Similarity 52.0%; Pred. No. 0.052; 

Matches 116; Conservative 0; Mismatches 107; Indels 0; Gaps 0; 

Qy 374 CGC GGTG ATGGGACGCAC GGAAAC GGGAGTGGAGAAAGTC ATGGAGAGAACC C TAGGCGG 433 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 1156 CGC GC GC CGGGACGGGAC GGGACGGGGC GGGGCGGGGCGGGAC GAGACGGGGCGGGGCGG 1215 
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434 



GGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAG 493 



Db 



1216 



GGCGGGCCGGGCAGCCTCCGGGCGGCGCGGCGCGGGCGGCGGCCGGATCCAGGGCGGGGG 1275 



494 



GCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCG 553 



Db 



1276 



TCGGCGGCCCGGCCAGCCCGGCCCGGCCCGGGGCCGCGTCCTGAGAGTCAGCCCTCGCCG 1335 



Qy 



554 



CCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCC 596 



Db 



1336 



CTGCAGCCTCGGCGCCCGGCCGGCCGGCCATGGAGCGCCCCCC 1378 



RESULT 15 
ADJ35230 

ID ADJ35230 standard; DNA; 2500 BP. 
XX 

AC ADJ35230; 
XX 

DT 22-APR-2004 (first entry) 
XX 

DE Human stabilising anti-repression, STAR, element #66. 
XX 

KW STAR affiliated proteinaceous molecule; post translational modification; 

KW human; stabilising anti-repression; STAR; STAR element; ds . 

XX 

OS Homo sapiens . 
XX 

PN WO2003106674-A2 . 
XX 

PD 24-DEC-2003. 
XX 

PF 30-MAY-2003; 2003WO-NL000410 . 
XX 

PR 14-JUN-2002; 2002EP-00077344 . 
XX 

PA (CHRO-) CHROMAGENICS BV. 
XX 

PI Otte AP, Kruckeberg AL, Satijn DPE; 
XX 

DR WPI; 2004-082195/08. 
XX 

PT Producing proteinaceous molecules in cells by selecting a cell, providing 

PT a nucleic acid encoding a proteinaceous molecule with an STabilizing Anti 

PT -Repression sequence and expressing proteinaceous molecule. 
XX 

PS Example 1; SEQ ID NO 66; 177pp; English. 
XX 

CC The invention relates to a method of producing a proteinaceous molecule 

CC (I) in a cell comprising selecting a cell for its suitability for 

CC producing (I), providing a nucleic acid encoding (I) with a nucleic acid 

CC comprising a STabilising Anti-Repression (STAR) sequence, expressing the 

CC resulting nucleic acid in the cell and collecting (I) . The method is 

CC useful for producing (I) . A cell line (II) provided with a nucleic acid 

CC comprising a STAR sequence Is useful for producing (I) . (II) Enables 

CC production of affiliated proteinaceous molecule, as cell carries out 

CC proper post-translational modifications of produced proteins. The present 

CC sequence represents a human stabilising anti-repression, STAR, element. 

XX 

SQ Sequence 2500 BP; 295 A; 903 C; 980 G; 322 T; 0 U; 0 Other; 
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Query Match 8.1%; Score 51.8; DB 12; Length 2500; 

Best Local Similarity 52.0%; Pred. No. 0.052; 

Matches 116; Conservative 0; Mismatches 107; Indels 0; Gaps 0; 

Qy 374 CGC GGTGATGGGACGCACGGAAAC GGGAGTGGAGAAAGTC ATGGAGAGAACCC TAGGCGG 433 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 1156 CGC GCGC CGGGACGGGACGGGAC GGGGC GGGGCGGGGC GGGACGAGACGGGGC GGGGC GG 1215 

Qy 434 GGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAG 4 93 

I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 1216 GGCGGGCCGGGCAGCCTCCGGGCGGCGCGGCGCGGGCGGCGGCCGGATCCAGGGCGGGGG 1275 

Qy 494 GCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCG 553 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 127 6 TCGGCGGCCCGGCCAGCCCGGCCCGGCCCGGGGCCGCGTCCTGAGAGTCAGCCCTCGCCG 1335 

Qy 554 CCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCC 596 

III I I I I I I I I II III I It I I I I I 

Db 1336 CTGCAGCCTCGGCGCCCGGCCGGCCGGCCATGGAGCGCCCCCC 1378 
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Gapop 10.0 , Gapext 1.0 

Searched: 6366136 seqs, 31973710525 residues 

Total number of hits satisfying chosen parameters: 12732272 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



GenEmbl : * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 



gb_env : * 
gb_jpat : * 
gb_ph : * 
gb_pl : * 
gb_pr : * 
gb_ro : * 
gb_sts : * 
gb_sy : * 
gb_un : * 
gb_vi : * 
gb_o v : * 
gb_htg : * 
gb_in: * 
gb_om : * 
gb_ba : * 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
E54511 
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LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



E54511 3505 bp DNA linear PAT 31-JAN-2002 

UCP-2 promoter and use thereof. 

E54511 

E54511.1 GI:18629692 
JP 2000236886-A/l. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
1 (bases 1 to 3505) 

Toyota, Y . , Kobayashi,M. and Igaki,S. 
UCP-2 promoter and use thereof 
Patent: JP 2000236886-A 1 05-SEP-2000; 
TAKEDA CHEM IND LTD 
OS 
PN 
PD 
PF 
PR 
PI 
PC 



Homo sapiens (human) 
JP 2000236886-A/l 
05-SEP-2000 

22-DEC-1999 JP 1999364724 



YUKIO TOYOTA, MAKOTO KOBAYASHI , SHIGERU IGAKI 

C12N15/09,A61K45/00,A61P3/04, A61P3/06, A61P3/10, A61P9/12, PC 
A61P29/00,C12N1/21, 

PC C12N5/10,C12Q1/02,G01N33/15,G01N33/50//A61K31/711,A61K38/00, 
A61K48/00, 

(C12N15/09,C12R1:19) , (C12N15/09, C12R1 : 91) , (C12N1/21, C12R1 : 19) , 
(C12N5/10,C12R1:91) , C12N15/00, C12N5/00, A61K37/02, (C12N15/00, 
C12R1:19) , 

(C12N15/00,C12R1:91) , (C12N5/00, C12R1 : 91 ) 



PC 
PC 
PC 
PC 
PC 
CC 
FH 
FT 
FT 



FEATURES 

source 



ORIGIN 



Key Location/Qualifiers 
source 1. .3505 

/organism='Homo sapiens 

Location/Qualifiers 

1. .3505 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db xref="taxon:9606" 



(human) 



Query Match 100.0%; Score 637; DB 2; Length 3505; 

Best Local Similarity 100.0%; Pred. No. 1.8e-169; 

Matches 637; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 1 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1634 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 1693 



Qy 

Db 

Qy 

Db 



61 GAC AGC GAGGC CGGT C GCGAGGC C C C AG T C C C GC C C TGCAGGAGC C AGCCGC GC GC TC GC 120 

I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1694 GACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCGCCCTGCAGGAGCCAGCCGCGCGCTCGC 1753 



121 



180 



TCGCAGGAGGGTGGGTAGT T TGCCCAGC GTAGGGGGGC TGGGCC CATAAAAGAGGAAGTG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
1754 TCGCAGGAGGGTGGGTAGT TTGCCCAGCGTAGGGGGGCTGGGCC CATAAAAGAGGAAGTG 1813 



Qy 181 C AC T T AAGAC ACGGC C C CGC T GGACGC T T GT T AGAAACCGT CC T GGCT GGGAAGGC AAGA 240 

I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1814 C AC T TAAGACACGGC C CCGC TGGACGC T T GT T AGAAAC CG T C C T GGCT GGGAAGGC AAGA 1873 

Qy 241 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 300 
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x* age t ui t*y 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1874 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 1933 

301 GCGGCCC GAGAGAGT GTGAGGC AGAGGC GGGGAGT GGCAAGGGAGT GAC CAT C T C GGGGA 360 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I 
1934 GCGGCCCGAGAGAGT GTGAGGCAGAGGC GGGGAGT GGCAAGGGAGTGACC AT C TCGGGGA 1993 

361 ACGAAGGAGTAAACGC GGT GAT GGGAC GC AC GGAAAC GGGAGTGGAGAAAGT C ATGGAGA 420 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1994 ACGAAGGAGTAAACGC GGT GAT GGGAC GC AC GGAAAC GGGAGTGGAGAAAGT C ATGGAGA 2053 



421 



480 



GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
2054 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 2113 



481 



540 



TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
2114 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 2173 



541 



600 



TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
2174 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 2233 



601 



637 



GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2234 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 2270 



RESULT 2 
DQ087219 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



misc feature 



DQ087219 12177 bp DNA linear PRI 18-JUN-2005 

Homo sapiens uncoupling protein 2 (mitochondrial, proton carrier) 
(UCP2) gene, complete cds; nuclear gene for mitochondrial product. 
DQ087219 

DQ087219.1 GI: 67515418 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo. 
1 (bases 1 to 12177) 

Livingston, R. J. , Rieder,M.J., Shaffer, T., Bertucci,C, Baier,C.N., 
Rajkumar,N., Willa,H.T., Daniels, M., Downing, T .K. , Stanaway, I . B . , 
Nguyen, C. P. , Gilder sleeve, H. , Cassidy, C ,M. , Johnson, E.J. , 
Swanson, J.E. , McFarland, I . , Yool,B., Park,C. and Nickerson, D . A. 
Direct Submission 

Submitted ( 07-JUN-2005 ) Genome Sciences, University of Washington, 
1705 NE Pacific, Seattle, WA 98195, USA 

To cite this work please use: NIEHS-SNPs, Environmental Genome 
Project, NIEHS ES15478, Department of Genome Sciences, Seattle, WA 
(URL: http://egp.gs.washington.edu) . 
Location/Qualifiers 
1. .12177 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="taxon:9606" 
33. .342 

/rpt_family-"Alu" 
/rpt_type=dispersed 
618. .723 



repeat_region 
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variation 

repeat_region 

variation 

variation 

variation 

gene 
mRNA 

variation 

variation 

repeat_region 
variation 

variation 

variation 

variation 

variation 

repeat_region 
variation 



/note="Region not scanned for variation" 
866 

/frequency="0.01" 

/replace="a" 

989. .1075 

/rpt_family="MIR" 

/ rpt_t ype=di sper s ed 

1135 

/frequency="0.41" 

/replace="a" 

1151 

/frequency="0 .04" 

/replace="a" 

1960 

/frequency="0.03" 
/replace="t" 
2001. .10177 
/gene="UCP2" 

join(2001. .2124,3213. .3369,6369. .6593,6750. .6960, 
7829. .8023,8104. .8205,9175. .9355,9727. .10177) 
/gene="UCP2 " 

/product-"uncoupling protein 2 (mitochondrial, proton 

carrier) " 

2052 

/gene="UCP2" 
/frequency="0.04" 
/replace="t" 
2089 

/gene="UCP2" 

/frequency="0.06" 

/replace="g" 

2242. .2540 

/ r p t_f amily= " Alu " 

/rpt_type=dispersed 

2469 

/gene="UCP2 " 
/frequency="0.48" 
/replace="c" 
2818 

/gene="UCP2" 
/frequency="0.02" 
/replace="t" 
2900 

/gene="UCP2" 
/frequency="0 .06" 
/replace="t" 
3310 

/gene="UCP2" 

/f requency=" 0.05" 

/replace="t" 

3473 

/gene="UCP2" 

/frequency="0.07" 

/replace="g" 

3574. .3699 

/rpt_family="MIR" 

/rpt_type=dispersed 

3856 

/gene="UCP2" 

/f requency=" 0.98" 

/replace=" " 
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variation 

variation 

variation 

variation 

repeat_region 
misc_f eature 
repeat_region 
repeat_region 
variation 

variation 

variation 

variation 

CDS 



variation 



3957 

/gene="UCP2" 
/f requency="0 .02" 
/replace="g" 
3999 

/gene="UCP2" 
/frequency="0.02" 
/replace="a" 
4047 

/gene="UCP2" 
/frequency="0 . 01" 
/replace="g" 
4117 

/gene="UCP2 " 
/ f r equency= "0.10" 
/replace="g" 
4180. .4254 
/rpt_family="MIR" 
/rpt_type=dispersed 
4388. .5633 
/gene="UCP2" 

/note="Region not scanned for variation" 
5001. .5272 
/rpt_family="Alu" 
/rpt_type=dispersed 
5273. .5581 
/rpt_family="Alu" 
/rpt_type=dispersed 
5955 

/gene="UCP2" 
/f requency="0 . 02 " 
/replace="g" 
5972 

/gene="UCP2 " 
/frequency=" 0 . 01" 
/replace="a" 
6328 

/gene="UCP2" 
/ f r equency= " 0 . 
/replace="a" 
6374 

/gene="UCP2" 
/ f requency=" 0 . 
/ replace="a" 

join(6468. .6593,6750. .6960,7829. .8023,8104. .8205, 
9175. .9355,9727. .9841) 
/gene-"UCP2 " 
/codon_start=l 

/product="uncoupling protein 2 (mitochondrial, proton 
carrier) " 

/protein_id="AAY68217 . 1" 
/db_xref="GI : 67515419" 

/translation="MVGFKATDVPPTATVKFLGAGTAACIADLITFPLDTAKVRLQIQ 
GESQGPVRATAS AQYRGVMGT I LTMVRTEGPRS LYNGLVAGLQRQMS FAS VRIGLYDS 
VKQ F YT KG SEHAS I GS RL LAG S TTGALAVAVAQ P T DWKVR FQAQARAGGGRR Y Q S TV 
NAYKT IAREEGFRGLWKGTS PNVARNAI VNCAE LVTYDLIKDALLKANLMTDDL PCHF 
TSAFGAGFCTTVIAS PVDWKTRYMNSALGQYS SAGHCALTMLQKEGPRAFYKGFMPS 
FLRLGSWNWMFVT YEQ LKRALMAAC T S REAP F " 
6779 

/gene="UCP2" 
/frequency="0.01" 



.01" 



,01" 
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variation 



variation 



repeat_region 



variation 



repeat_region 



variation 



repeat_region 



variation 



variation 



variation 



variation 



variation 



variation 



/replace="t" 
6787 

/gene="UCP2" 
/frequency="0 . 46" 
/replace="t" 
6850 

/gene-"UCP2 " 
/frequency="0 .02" 
/replace="a" 
7113. .7334 
/rpt_family="MIR" 
/ rpt_type=dispersed 
7357 

/gene="UCP2 " 

/f requency=" 0.02" 

/replace=" " 

7361. .7480 

/rpt_family="MIR" 

/rpt_type=dispersed 

7435 

/gene-"UCP2 " 

/f requency="0 . 01" 

/replace="c" 

7486. .7570 

/rpt_family="MIR" 

/rpt_type=dispersed 

7566 

/gene="UCP2 " 
/frequency="0.02" 
/replace="g" 
7723 

/gene="UCP2" 

/f requency=" 0.07" 

/replace="a" 

7747 

/gene="UCP2 " 
/frequency="0.02" 
/replace="a" 
7794 

/gene="UCP2 " 

/ f r e que n c y = " 0 . 0 1 " 

/replace="a" 

7804 

/gene="UCP2 " 

/f requency="0 . 01" 

/replace="t" 

7952 



Query Match 98.1%; 
Best Local Similarity 99.8%; 
Matches 636; . Conservative 



Score 625; DB 5; Length 12177; 
Pred. No. 5.3e-166; 
0; Mismatches 0; Indels 1; 



Gaps 



1; 



Qy 



Db 



1 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
1432 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 1491 



Qy 61 GAC AGCGAGGC C GGT C GCGAGGCC CC AG TCC C GC C C TGC AGGAGCC AGC C GC GCGC TCGC 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1492 GAC AGC GAGGC C GGT C GCGAGGCC CC AG TCC C GC C C TGC AGGAGCC AGC C GC GCGC TCGC 1551 

Qy 121 T CGC AGGAGGG T GGG TAGT T TGC C C AGC GT AGGGGGGC TGGGCC CATAAAAGAGGAAGTG 180 
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I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I 
Db 1552 TCGCAGGAGGGTGGGTAGTTTGCCCAGCGTAGGGGGGCTGGGCCCATAAAAGAGGAAGTG 1611 

Qy 181 C AC T T AAGAC AC GGC CC CGC TGGAC GC T TGT T AGAAAC C GTC C T GGC TGGGAAGGC AAGA 240 

I I I I II I I I I I I I I II I I I I M I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1612 C AC T T AAGAC AC GGC CC C GC TGGAC GC - TGT TAGAAAC C GT CC T GGC TGGGAAGGCAAGA 1670 

Qy 241 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 300 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1671 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 1730 

Qy 301 GCGGCCC GAGAGAGTGTGAGGCAGAGGCGGGGAGTGGCAAGGGAGTGACCAT CTCGGGGA 360 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1731 GCGGCCC GAGAGAGTGTGAGGCAGAGGC GGGGAGTGGCAAGGGAGTGACCATC TCGGGGA 17 90 

Qy 361 AC GAAGGAGTAAACGC GGT GATGGGACGC ACGGAAAC GGGAGTGGAGAAAGT C ATGGAGA 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1791 AC GAAGGAGTAAACGC GGTGATGGGACGCACGGAAACGGGAGTGGAGAAAGTCATGGAGA 1850 

Qy 421 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1851 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 1910 

Qy 481 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1911 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 1970 

Qy 541 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 600 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1971 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 2030 

Qy 601 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2031 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 2067 



RESULT 3 

AP003717/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



AP003717 156370 bp DNA linear PRI 27-APR-2002 

Homo sapiens genomic DNA, chromosome llq clone :RP11-167N4 , complete 
sequences . 
AP003717 

AP003717.3 GI: 20334343 
HTG. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo . 
1 

Hattori,M., Ishii,K., Toyoda,A., Taylor, T.D., Hong-Seog, P . , 
Fujiyama, A., Yada,T., Totoki,Y., Watanabe,H. and Sakaki,Y. 
Homo sapiens genomic DNA 
Published Only in Database (2001) 
2 (bases 1 to 156370) 

Hattori,M., Ishii,K., Toyoda,A., Taylor, T.D., Hong-Seog, P . , 
Fujiyama, A., Yada,T., Totoki,Y., Watanabe,H. and Sakaki,Y. 
Direct Submission 

Submitted (04-JUN-2001 ) Masahira Hattori, The Institute of Physical 
and Chemical Research (RIKEN) , Genomic Sciences Center (GSC) ; 
1-7-22 Suehiro-chou, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan 
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( E-mail : hattori@gsc . riken . go . j p, URL : http : //hgp . gsc . r iken . go . j p/ , 
161:81-45-503-9111, Fax:81-45-503-9170) 
COMMENT On Apr 26, 2002 this sequence version replaced gi: 16904692. 

FEATURES Location/Qualifiers 
source 1. .156370 

/organism="Homo sapiens" 
/mo l_type=" genomic DNA" 
/db_xref="taxon:9606" 
/ chr omo s ome= " 1 1 " 
/map="llq" 
/clone="RPll-167N4 " 

ORIGIN 

Query Match 98.1%; Score 625; DB 5; Length 156370; 

Best Local Similarity 99.8%; Pred. No. 6.9e-166; 

Matches 636; Conservative 0; Mismatches 0; Indels 1; Gaps 1; 

Qy 1 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 43495 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 43436 

Qy 61 GACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCGCCCTGCAGGAGCCAGCCGCGCGCTCGC 120 

I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 3435 GACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCGCCCTGCAGGAGCCAGCCGCGCGCTCGC 4337 6 

Qy 121 TCGCAGGAGGGTGGGTAGT TTGCC CAGC GTAGGGGGGC TGGGCCCATAAAAGAGGAAGTG 180 

I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 43375 TCGCAGGAGGGTGGGTAGT TTGCCCAGC GTAGGGGGGC TGGGCCCATAAAAGAGGAAGTG 43316 

Qy 181 C AC T TAAGAC ACGGC CC C GCT GGAC GC T T GT T AGAAAC C GTC C T GGC T GGGAAGGC AAGA 240 

I I I II I I II I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I II I I I I I I I II I I I I I I I 
Db 43315 CAC T TAAGAC ACGGC C C C GCT GGAC GC - T GT T AGAAACC GT CC T GGC TGGGAAGGC AAGA 43257 

Qy 241 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 300 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 43256 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 43197 

Qy 301 GCGGC C C GAGAGAGT GTGAGGC AG AGGC GGGGAGTGGCAAGGGAGTGAC CAT C T C GGGGA 360 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 43196 GCGGCCCGAGAGAGTGTGAGGCAGAGGC GGGGAGTGGCAAGGGAGTGAC CAT C TCGGGGA 43137 

Qy 361 AC GAAGGAGTAAACGC GGT GAT GGGAC GCAC GGAAAC GGGAGTGGAGAAAGT C ATGGAGA 420 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 43136 AC GAAGGAGTAAACGC GGT GAT GGGAC GCAC GGAAAC GGGAGTGGAGAAAGT C ATGGAGA 43077 

Qy 421 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 480 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 43076 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 43017 

Qy 481 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 540 

I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 43016 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 42957 

Qy 541 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I 
Db 42956 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 42897 

Qy 601 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 42896 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 42860 
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RESULT 4 

AP003531/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



COMMENT 
FEATURES 

source 



AP003531 199384 bp DNA linear PRI 27-APR-2002 

Homo sapiens genomic DNA, chromosome llq clone : RP11-535C12, 
complete sequences. 
AP003531 

AP003531.2 GI: 20334341 
HTG. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
1 

Hattori,M., Ishii,K., Toyoda,A., Taylor, T.D. 
Fujiyama, A., Yada,T., Totoki,Y., Watanabe,H. 
Homo sapiens genomic DNA 
Published Only in Database (2001) 
2 (bases 1 to 199384) 

Hattori,M., Ishii,K., Toyoda,A., Taylor, T.D. 
Fujiyama, A., Yada,T., Totoki,Y., Watanabe,H. 
Direct Submission 

Submitted (18-APR-2001) Masahira Hattori, The Institute of Physical 
and Chemical Research (RIKEN) , Genomic Sciences Center (GSC) ; 
1-7-22 Suehiro-chou, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan 
(E-mail : hattori@gsc . riken. go . jp, URL : http : //hgp . gsc . riken . go . jp/ , 
Tel: 81-45-503-9111, Fax:81-45-503-9170) 

On Apr 26, 2002 this sequence version replaced gi: 13699094. 
Location/Qualifiers 
1. .199384 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref ="taxon: 9606" 
/chromosome="ll" 
/map="llq" 

/clone="RPll-535C12 " 



- Hong-Seog,P. 
and Sakaki,Y. 



. Hong-Seog, P . 
and Sakaki,Y. 



ORIGIN 



Query Match 98.1%; Score 625; DB 5; Length 199384; 

Best Local Similarity 99.8%; Pred. No. 7e-166; 

Matches 636; Conservative 0; Mismatches 0; Indels 1; 



Gaps 



1; 



Qy 1 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I II I I I I I 1 
Db 181036 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 180977 

Qy 61 GAC AGC GAGGC C GGT CGC G AGGC C C C AGTCC C GC C C T GC AGGAGC C AGC CGC GC GC TC GC 120 

I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 180976 GACAGC GAGGC CGGT CGC GAGGCCCCAGTCCCGCCCTGCAGGAGCCAGCCGCGCGCTCGC 180917 



Qy 

Db 

Qy 

Db 

Qy 



121 TC GCAGGAGGGT GGGT AGT T TGC C C AGC GT AGGGGGGC TGGGC C CATAAAAGAGGAAGTG 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
180916 TCGC AGGAGGG T GGG TAGT T TGC C CAGC GT AGGGGGGC TGGGC C CATAAAAGAGGAAGTG 180857 

181 C AC T TAAGAC AC GGC C CCGC TGGACGC T TGT T AGAAACCGT CC T GGC T GGGAAGGC AAGA 240 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
180856 CAC TTAAGACACGGCCCCGCTGGACGC - TGTTAGAAACCGTCCTGGCTGGGAAGGCAAGA 180798 



241 GGT GT GT GAC T GGAC AAGAC TTGTTTCT GGCGGTC AGTC T T GC C ATC C T C AC AGAGGT TG 
I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



300 
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Db 180797 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 180738 

Qy 301 GCGGCCC GAGAGAGT GTGAGGCAGAGGC GGGGAGTGGCAAGGGAGTGACCATC TCGGGGA 360 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M I I I I I I 
Db 180737 GCGGCCC GAGAGAGT GTGAGGCAGAGGC GGGGAGTGGCAAGGGAGTGACCAT C TCGGGGA 180678 

Qy 361 AC GAAGGAGTAAACGC GGT GATGGGAC GC ACGGAAAC GGGAGTGGAGAAAGT C ATGGAGA 420 

I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 180677 AC GAAGGAGTAAACGCGGT GATGGGAC GC AC GGAAAC GGGAGTGGAGAAAGT C ATGGAGA 180618 

Qy 421 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 480 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I 
Db 180617 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 180558 

Qy 481 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 180557 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 180498 

Qy 541 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 180497 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 180438 

Qy 601 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I 
Db 180437 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 180401 



RESULT 5 
AF306570 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



AF306570 3270 bp DNA 

Homo sapiens uncoupling protein 2 gene, 
nuclear gene for mitochondrial product. 
AF306570 

AF306570.1 GI: 11037742 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



linear PRI 30-OCT-2000 
promoter region and exon 1; 



Vertebrata; Euteleostomi; 



Primates; Catarrhini; 



Esterbauer, H . 
1 



and Patsch,W. 



Esterbauer , H . and Patsch,W. 



promoter 
mRNA 



exon 



Mammalia; Eutheria; Euarchontoglires ; 
Hominidae ; Homo . 

1 (bases 1 to 3270) 
Schneitler, C . , Oberkof ler , H . , 
UCP2 promoter region and exon 
Unpublished 

2 (bases 1 to 3270) 
Schneitler, C . , Oberkof ler, H . 
Direct Submission 

Submitted ( 18-SEP-2000) Laboratory Medicine, Landeskliniken 
Salzburg, Muellner Hauptstr. 48, Salzburg A-5020, Austria 

Location/Qualifiers 

1. .3270 

/organism="Homo sapiens" 
/mo l_type=" genomic DNA" 
/db_xref="taxon: 9606" 
/ chr omo s ome ="11" 
/map="llql3" 
1. .3117 
3118. .>3241 

/product="uncoupling protein 2" 
/note="UCP2" 
3118. .3241 
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/ n umber =1 



ORIGIN 



Query Match 97.9%; 
Best Local Similarity 99.7%; 
Matches 635; Conservative 



Score 62 3.4; DB 5; 
Pred. No. 1.3e-165; 
0; Mismatches 1; 



Length 3270; 



Indels 



1; Gaps 



1; 



Qy 1 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2549 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 2608 



Qy 

Db 

Qy 

Db 

Qy 

Db 



61 GACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCGCCCTGCAGGAGCCAGCCGCGCGCTCGC 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2609 GAC AGC GAGGC C GGT CGC GAGGC C C CAG TC C C GC C C TGC AGGAGC C AGC CGC GCGC TCGC 2668 



121 



180 



TCGCAGGAGGGTGGGTAGT T TGCCCAGC GTAGGGGGGC TGGGCC CATAAAAGAGGAAGTG 
I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2669 TCGCAGG AGGGT GGGTAGT T TGC C CAGC GTAGGGGGGC T GGGC C CATAAAAGAGGAAGTG 2728 



181 



240 



C AC T TAAGAC AC GGC CCC GC TGGACGC T T GT T AGAAAC C GTCC T GGC T GGGAAGGC AAGA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
2729 C AC T TAAGAC AC GGC CCC GC TGGACGC - T GT TAGAAAC C GT CC TGGC T GGGAAGGCAAGA 2787 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



241 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 300 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2788 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 2847 



301 



360 



GCGGCCC GAGAGAGTGTGAGGCAGAGGCGGGGAGTGGCAAGGGAGTGACC AT CTCGGGGA 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
2848 GCGGCCC GAGAGAGT GTGAGGCAGAGGCGGGGAGTGGCAAGGGAGTGACC AT CTCGGGGA 2907 



361 



420 



ACGAAGGAGTAAACGCGGT GAT GGGACGC AC GGAAACGGGAGT GGAGAAAGT C ATGGAGA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2908 ACGAAGGAGTAAACGCGGTGATGGGACGCACGGAAACGGGAGTGGAGAAAGTCATGGAGA 2967 



421 



480 



GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
2968 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 3027 



481 



540 



TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 
I I I I I I I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
3028 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCTCCCCCGAGGC 3087 



541 



600 



TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3088 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 3147 



601 



637 



GAT TGACTGTCCACGC TCGC CC GGC TCGTCCGACGCG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3148 GAT TGACTGTCCACGC TCGC CCGGC TCGTCCGACGCG 3184 



RESULT 6 
AC019121 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 



AC019121 197031 bp DNA linear HTG 07-JUL-2000 

Homo sapiens chromosome 11 clone RP11-535C12, WORKING DRAFT 
SEQUENCE, 23 unordered pieces. 
AC019121 

AC019121.3 GI:8440022 

HTG; HTGS_PHASE1; HTGS_DRAFT. 

Homo sapiens (human) 
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Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Catarrhini; 
Hominidae; Homo. 

1 (bases 1 to 197031) 
Waterston, R.H . 

The sequence of Homo sapiens clone 
Unpublished 

2 (bases 1 to 197031) 
Waterston, R.H. 
Direct Submission 

Submitted (30-DEC-1999) Genome Sequencing Center, Washington 
University School of Medicine, 4444 Forest Park Parkway, St. Louis, 
MO 63108, USA 

On Jun 10, 2000 this sequence version replaced gi: 7105573. 

Genome Center 

Center: Washington University Genome Sequencing Center 
Center code: WUGSC 

Web site:http : //genome .wustl . edu/gsc/index . shtml 

Project Information 

Center project name: H_NH0535C12 

Summary Statistics 

Sequencing vector: M13; 55% 
Sequencing vector: plasmid; 45% 
Chemistry: Dye-primer ET; 55% of reads 
Chemistry: Dye-terminator Big Dye; 45% of reads 
Assembly program: Phrap; version 0.990319 
Consensus quality: 182418 bases at least Q40 
Consensus quality: 187565 bases at least Q30 
Consensus quality: 190012 bases at least Q20 
Insert size: 190000; agarose-fp 
Insert size: 194831; sum-of-contigs 
Quality coverage: 4.10 in Q20 bases; agarose-fp 
Quality coverage: 4.05 in Q20 bases; sum-of-contigs 



* NOTE: This is a "working draft 1 sequence. It currently 

* consists of 23 contigs . The true order of the pieces 

* is not known and their order in this sequence record is 

* arbitrary. Gaps between the contigs are represented as 

* runs of N, but the exact sizes of the gaps are unknown. 

* This record will be updated with the finished sequence 

* as soon as it is available and the accession number will 

* be preserved. 



* 


1 


2971: 


contig 


of 2971 


bp in length 


* 


2972 


3071: 


gap of 


unknown 


length 


★ 


3072 


5764: 


contig 


of 2693 


bp in length 


★ 


5765 


5864: 


gap of 


unknown 


length 




5865 


8568: 


contig 


of 2704 


bp in length 


★ 


8569 


8668: 


gap of 


unknown 


length 


* 


8669 


12865: 


contig 


of 4197 


bp in length 


* 


12866 


12965: 


gap of 


unknown 


length 


★ 


12966 


18581: 


contig 


of 5616 


bp in length 


* 


18582 


18681: 


gap of 


unknown 


length 


+ 


18682 


23851: 


contig 


of 5170 


bp in length 


★ 


23852 


23951: 


gap of 


unknown 


length 


★ 


23952 


28414 : 


contig 


of 4463 


bp in length 




28415 


28514 : 


gap of 


unknown 


length 


★ 


28515 


33195: 


contig 


of 4681 


bp in length 


★ 


33196 


33295: 


gap of 


unknown 


length 


★ 


33296 


38648: 


contig 


of 5353 


bp in length 
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ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



ov^valvi^ ocoioii jvcauita lycuuio iui -rvpyiiL/auuii anu ocaiwn xvcauii uru7-ou7... ra^c i*t ui ^ 



gap of unknown length 
contig of 6177 bp in length 
gap of unknown length 
contig of 6759 bp in length- 
gap of unknown length 
contig of 6971 bp in length 
gap of unknown length 
contig of 9334 bp in length 
gap of unknown length 
contig of 8734 bp in length 
gap of unknown length 
contig of 10069 bp in length 
gap of unknown length 
contig of 8637 bp in length 
gap of unknown length 
contig of 8662 bp in length 
gap of unknown length 
contig of 12021 bp in length 
gap of unknown length 
contig of 14356 bp in length 
gap of unknown length 
contig of 11525 bp in length 
gap of unknown length 
contig of 112 68 bp in length 
gap of unknown length 
contig of 19341 bp in length 
gap of unknown length 
contig of 23129 bp in length. 
FEATURES Location/Qualifiers 
source 1- .197031 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref ="taxon: 9606" 
/ chr omo s ome= "11" 
/clone="RPll-535C12" 
misc_f eature 1. .2971 

/note="assembly_name : Contig2 6" 
gap 2972. .3071 

/estimated_length==unknown 
misc_feature 3072. .5764 

/note="assembly_name : Contig27 " 
gap 5765. .5864 

/est ima ted_length=un known 
misc_f eature 5865. .8568 

/note="assembly_name:Contig28 " 
gap 8569. .8668 

/ e s t ima t ed_l eng th-unknown 
misc_f eature 8669. .12865 

/note="assembly_name : Contig2 9 " 
gap 12866. .12965 

/ estimated_length-unknown 
misc_feature 12966. .18581 

/note="assembly_name : Contig30" 
gap 18582. .18681 

/estimated_length=unknown 
misc_f eature 18682. .23851 

/note="assembly_name : Contig31 " 
gap 23852. .23951 

/est ima t ed_l eng th=unknown 
misc_f eature 23952. .28414 

/note="assembly_name : Contig32 " 



★ 


38649 


38748 


★ 


38749 


44925 


★ 


44926 


45025 


★ 


45026 


51784 




51785 


51884 




51885 


58855 

u ^ t^j 


★ 


58856 


58955 


★ 


58956 


68289 


★ 


68290 


68389 


★ 


68390 


77123 




77124 


77223 


+ 


77224 


87292 


★ 


87293 


87392 


* 


87393 


96029 


★ 


96030 


96129 


* 


96130 


1047 91 


★ 


104792 


1048 91 


★ 


104892 


116912 




116913 


117012 




117013 


131368 


* 


131369 


131468 


★ 




142993 


★ 


1 47994 


143093 


* 






★ 


154362 


154461 


★ 


154462 


173802 




173803 


173902 


★ 


173903 


197031 
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gap 

misc_feature 
gap 

misc_feature 
gap 

misc_f eature 
gap 

misc_f eature 
gap 

misc_f eature 
gap 

mi sc_f eature 
gap 

misc_feature 
gap 

misc_feature 
gap 

misc_feature 
gap 

misc_f eature 
gap 

misc_f eature 
gap 

mi sc_f eature 
gap 

misc_f eature 
gap 

misc_feature 
gap 

misc feature 



28415. .28514 

/est ima t ed_l eng th=unkno wn 

28515. .33195 

/note="assembly_name: Contig33" 
33196. .33295 
/ e s t ima t ed_l eng th=unkno wn 
33296. .38648 

/note="assembly_name : Contig34 " 
38649. .38748 
/est ima t ed_l eng th=unkno wn 
38749. .44925 

/note=" assembly_name : Contig35 " 
44926. .45025 
/estimated_length=unknown 
45026. .51784 

/note="assembly_name : Contig36" 
51785. .51884 
/ e s t ima t ed_l eng th=unkno wn 
51885. .58855 

/note= " a ssembly_name : Cont ig3 7 " 
58856. .58955 
/ e s t ima t ed_l eng th=unkno wn 
58956. .68289 

/note= " a ssembly_name : Contig3 8 " 
68290. .68389 
/ e s t ima t ed_l eng th=unkno wn 
68390. .77123 

/note="assembly_name : Contig3 9 " 
77124. .77223 
/est ima t ed_l eng th=unknown 
77224. .87292 

/note-"assembly_name : Contig40" 
87293. .87392 
/estimated_length=unknown 
87393. .96029 

/note=="assembly_name : Contig41 " 
96030. .96129 
/ e s t ima t ed_l eng th=unknown 
96130. .104791 

/note="assembly_name : Contig42 " 
104792. .104891 
/ es t ima t ed_l ength=unknown 
104892. .116912 

/note="assembly_name: Contig43" 
116913. .117012 
/ estimated_length-unknown 
117013. .131368 

/note="assembly_name :Contig44 " 
131369. .131468 
/estimated_length=unknown 
131469. .142993 

/note="assembly_name:Contig45" 

142994. .143093 

/est ima t ed_l eng th=unknown 

143094. .154361 

/note="assembly_name:Contig4 6 

clone_end:T7 

vector_side : left" 

154362. .154461 

/est ima t ed_l eng th=unkno wn 

154462. .173802 
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gap 

misc feature 



/note="assembly_name : Contig47 " 
173803. .173902 
/estimated_length==unknown 
173903. .197031 
/note="assembly_name : Contig48 
clone_end:SP6 
vector_side : right" 



ORIGIN 



Query Match 94.2%; Score 599.8; DB 12; Length 197031; 

Best Local Similarity 98.4%; Pred. No. le-158; 

Matches 627; Conservative 0; Mismatches 7; Indels 3; Gaps 



2; 



Qy 1 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 60 

I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 164736 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 164795 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



61 GAC AGCGAGGCC GGT C GC GAGGCC C C AG T C CC GC C C TGC AGGAGCC AGC CGC GC GC TCGC 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
164796 GAC AGCGAGGC C GGT C GC GAGGCC C CAGT C CC GC C C TGCAGGAGCC AGC CGC GC GC TCGC 164855 



121 



180 



TCGCAGGAGGGTGGGTAGTTTGCCCAGCGTAGGGGGGCTGGGCCCATAAAAGAGGAAGTG 
I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
164856 TCGCAGGAGGGTGGGTAGTTTGCCCAGCGTA-GGGGGCTGGGCCC ATAAAAGAGGAAGTG 164914 



181 



240 



C AC T TAAGAC ACGGC C CC GC TGGAC GC T T GT T AGAAAC C GT CC T GGC TGGGAAGGC AAGA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

164915 CACTTAAGACACGGCCCAG — T GGAC GC T GT TAGAAAC C GT C C T GGC TGGGAAGGCAAG A 164972 



241 



300 



GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
164973 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 165032 



301 



360 



GCGGCC C GAGAGAGTGTGAGGC AGAGGC GGGGAGTGGCAAGGGAGTGAC CAT C TC GGGGA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I II I I I I 
165033 GCGGCCC GAGAGAGT GTGAGGC AGAGGC GGGGAGTGGCAAGGGAGTGAC CAT CTCGGGGA 165092 



361 



420 



ACGAAGG AGTAAACGC GGT GATGGGACGC AC GGAAAC GGGAGTGGAGAAAGT C ATGGAGA 
I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I 
165093 ACGAAGGAGTAAAC GC GGT GATGGGACGC ACGGAAAC GGGAGTGGAGAAAGT C ATGGAGA 165152 



421 



480 



GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 
I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
165153 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 165212 



481 



540 



TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
165213 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 165272 



541 



600 



TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
165273 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 165332 



Qy 601 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II 
Db 165333 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 165369 



RESULT 7 
AC024029/C 

LOCUS AC024029 155668 bp DNA linear HTG 07-JUL-2000 
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DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



Homo sapiens chromosome 11 clone RP11-167N4, WORKING DRAFT 

SEQUENCE, 15 unordered pieces. 

AC024029 

AC024029.3 GI:7230916 

HTG; HTGS_PHASE1; HTGS_DRAFT . 

Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebra ta; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 

1 (bases 1 to 155668) 
Waterston, R.H . 

The sequence of Homo sapiens clone 
Unpublished 

2 (bases 1 to 155668) 
Waterston, R.H. 
Direct Submission 

Submitted (20-FEB-2000) Genome Sequencing Center, Washington 
University School of Medicine, 4444 Forest Park Parkway, St. Louis, 
MO 63108, USA 

On Mar 13, 2000 this sequence version replaced gi: 7109555. 



Genome Center 

Center: Washington University Genome Sequencing Center 
Center code: WUGSC 

Web site : http : //genome . wustl . edu/gsc/index . shtml 

Project Information 

Center project name: H_NH0167N04 

Summary Statistics 

Sequencing vector: M13; 100% 
Sequencing vector: plasmid; 0% 
Chemistry: Dye-primer ET; 100% of reads 
Chemistry: Dye-terminator Big Dye; 0% of reads 
Assembly program: Phrap; version 0.990319 
Consensus quality: 14 6450 bases at least Q40 
Consensus quality: 149629 bases at least Q30 
Consensus quality: 151087 bases at least Q20 
Insert size: 168000; agarose-fp 
Insert size: 154268; sum-of-contigs 
Quality coverage: 3.98 in Q20 bases; agarose-fp 
Quality coverage: 4.38 in Q20 bases; sum-of-contigs 

* NOTE: This is a 'working draft 1 sequence. It currently 

* consists of 15 contigs. The true order of the pieces 

* is not known and their order in this sequence record is 

* arbitrary. Gaps between the contigs are represented as 

* runs of N, but the exact sizes of the gaps are unknown. 

* This record will be updated with the finished sequence 

* as soon as it is available and the accession number will 

* be preserved. 

* 1 1806: contig of 1806 bp in length 

* 1807 1906: gap of unknown length 

* 1907 4798: contig of 2892 bp in length 

* 4799 4898: gap of unknown length 

* 4899 7312: contig of 2414 bp in length 

* 7313 7412: gap of unknown length 

* 7413 11277: contig of 3865 bp in length 

* 11278 11377: gap of unknown length 

* 11378 14368: contig of 2991 bp in length 

* 14369 14468: gap of unknown length 

* 14469 20130: contig of 5662 bp in length 



http://es/ScoreAccessWeb/Gefltem.action?AppId=09869098&seqId=532378&ItemName=^... 6/7/2006 



FEATURES 
sour 



ce 



* 


20131 


20230: 


gap of 


unknown length 




20231 


25513: 


contig 


of 5283 bp in length 


★ 


25514 


25613: 


gap of 


unknown length 


* 


25614 


30765: 


contig 


of 5152 bp in length 


★ 


30766 


30865: 


gap of 


unknown length 


★ 


30866 


37337: 


contig 


of 6472 bp in length 


★ 


37338 


37437: 


gap of 


unknown length 




37438 


45571: 


contig 


of 8134 bp in length 


★ 


45572 


45671: 


gap of 


unknown length 


* 


45672 


60199: 


contig 


of 14528 bp in length 


★ 


60200 


60299: 


crap of 


unknown length 


★ 


60300 


71424: 


contig 


of 11125 bp in length 


* 


71425 


71524 : 


qap of 


unknown length 


* 


71525 


86218: 


contig 


of 14694 bp in length 


★ 


86219 


86318: 


gap of 


unknown length 


* 


86319 


104104: 


contig 


of 177 8 6 bp in length 


* 


104105 


104204: 


gap of 


unknown length 


* 


104205 


155668: 


contig 


of 514 64 bp in length. 




Location/Qualifiers 




1. 


.155668 







misc_f eature 
gap 

mi sc_f eature 
gap 

misc_f eature 
gap 

mi sc_f eature 
gap 

misc feature 



gap 

misc_ 

gap 

misc_ 

gap 

misc_ 

gap 

misc 



feature 



feature 



feature 



feature 



/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9606" 
/ chr onto s ome= "11" 
/clone="RPll-167N4 " 
1. .1806 

/note="assembly_name : Contigl3 " 
1807. .1906 

/ es timated_l ength=un known 
1907. .4798 

/note="assembly_name : Contigl4 " 
4799. .4898 

/ estimated_length=un known 
4899. .7312 

/note="assembly__name : ContiglS" 
7313. .7412 

/ e s t ima t e d_l e ng th=un known 
7413. .11277 

/note="assembly_name : Contigl6" 
11278. .11377 
/estimated_length=unknown 
11378. .14368 

/note="assembly_name : Contigl7 
clone_end : S P 6 
vector_side : left" 
14369. .14468 
/estimated_length=unknown 
14469. .20130 

/note="assembly_name : Contigl8 " 
20131. .20230 
/estimated_length=unknown 
20231. .25513 

/note="assembly_name:Contigl9" 
25514. .25613 
/estimated_length=unknown 
25614. .30765 

/note="assembly_name:Contig20" 
30766. .30865 
/estimated_length=unknown 
30866. .37337 

/note="assembly_name : Contig21 " 
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gap 37338. .37437 

/estimated_length=unknown 
misc_feature 37438. .45571 

/note="assembly_name : Contig22 " 
gap 45572. .45671 

/estimated_length=unknown 
misc_feature 45672. .60199 

/note="assembly__name : Contig23 " 
gap 60200. .60299 

/ e s t ima t ed_l ength=un known 
misc_feature 60300. .71424 

/note="assembly_name :Contig24 " 
gap 71425. .71524 

/es t ima ted_length=un known 
misc_feature 71525. .86218 

/note="assembly_name : Contig25 

clone_end:T7 

vector_side : right" 
gap 86219. .86318 

/ e s t ima t ed_l eng th=unknown 
misc_feature 86319. .104104 

/note="assembly_name :Contig26" 
gap 104105. .104204 

/estimated__length=unknown 
misc_feature 104205. .155668 

/note="assembly_name : Contig27 " 

ORIGIN 

Query Match 94.1%; Score 599.4; DB 12; Length 155668; 

Best Local Similarity 99.4%; Pred. No. 1.3e-158; 

Matches 633; Conservative 0; Mismatches 1; Indels 3; Gaps 3; 

Qy 1 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 60 

I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 68782 ACCTGTTGCTGCAGTTCTGATTGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATC 6872 3 

Qy 61 GACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCGCCCTGCAGGAGCCAGCCGCGCGCTCGC 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 68722 GACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCGCCCTGCAGGAGCCAGCCGCGCGCTCGC 68 663 

Qy 121 TCGCAGGAGGGTGGGTAGT TTGCC C AGCGTAGGGGGGC TGGGCC CATAAAAGAGGAAGTG 180 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 68662 TCGCAGGAGGGTGGGTAGT TTGCCCAGCGTAGGGGGGC TGGGCC CATAAAAGAGGAAGTG 68603 

Qy 181 CAC TTAAGACACGGC CCCGCTGGACGC T TGTT AGAAACCGTCCTGGC TGGGAAGGC AAGA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I 

Db 68602 C AC TTAAGAC AC GGC C CCGC TGGACGC - TGT T AGAAAC CGT C C TGGC TGGGAAGGC AAGA 68544 

Qy 241 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 300 

II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 68543 GGTGTGTGACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTG 68484 

Qy 301 GCGGCC C GAGAGAGT GTGAGGC AGAGGC GGGGAGTGGCAAGGGAGTGAC CAT C T CGGGGA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 68483 GCGGCCCGAGAGAGTGTGAGGC AGAGGC GGGGAGTGGCAAGGGAGTGAC CAT CTCGGGGA 68424 

Qy 361 AC GAAGGAGTAAAC GC GGT GATGGGAC GC AC GGAAACGGGAGTGGAGAAAGT C ATGGAGA 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 68423 AC GAAGGAGTAAAC GC GGT GATGGGAC GC AC GGAAACGGGAGTGGAGAAAGT C ATGGAGA 68364 

Qy 421 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 480 
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I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 68363 GAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAG 68304 

Qy 481 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGC 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 68303 TAGGAGCTGGCAGGCCCGGCCCCGCCCCGCATGCCCCACCCCGGGCCCCGCCCCCGAGGC 68244 

Qy 541 TTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 68243 TTAAG-CGCGCCGCCGCCTGCGC-GAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGG 68186 

Qy 601 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 68185 GATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 68149 



RESULT 8 
AK025742 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



COMMENT 



AK025742 2131 bp mRNA linear PRI 13-SEP-2003 

Homo sapiens cDNA: FLJ22089 fis, clone HEP16080, highly similar to 
HSU94592 Human uncoupling protein homolog (UCPH) mRNA. 
AK025742 

AK025742.1 GI: 10438354 

oligo capping; fis (full insert sequence). 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo. 



FEATURES 

source 



Kobatake,N., Inagaki,H., Ikema,Y., 
Ota,T., Suzuki, Y., Obayashi,M., Nishi,T. 
r Nakamura,Y., Isogai,T. and Sugano,S. 



Nishi,T., Isogai,T., 



Kawabata,A., Hikiji,T., 
Okamoto,S., Okitani,R., 
Shibahara, T . , Tanaka,T. 
NEDO human cDNA sequencing project 
Unpublished 
2 (bases 1 to 2131) 

Sugano,S., Suzuki, Y., Ota,T., Obayashi,M. 
Shibahara, T. , Tanaka,T. and Nakamura,Y. 
Direct Submission 

Submitted (29-AUG-2000) Sumio Sugano, Institute of Medical Science, 
University of Tokyo, Laboratory of Genome Structure Analysis, Human 
Genome Center; Shirokane-dai, 4-6-1, Minato-ku, Tokyo 108-8639, 
Japan (E-mail: f lcdna@ims .u-tokyo . ac . jp, Tel : 81-3-5449-5286, 
Fax:81-3-5449-5416) 

NEDO human cDNA sequencing project supported by Ministry of 
International Trade and Industry of Japan; cDNA full insert 
sequencing: Research Association for Biotechnology; cDNA library 
construction, 5 r - & 3 '-end one pass sequencing: Departent of 
Virology and Human Genome Center, Institute of Medical Science, 
University of Tokyo (partly supported by Science and Technology 
Agency) . 

Location/Qualifiers 
1. .2131 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="HEP16080" 
/cell_line="HepG2 " 
/cell_type="hepatoma " 
/clone_lib="HEP" 

/note^"cloning vector pME18SFL3" 



http://es/ScoreAccessWeb/G^ 6/7/2006 



o\^valvi-> ocoii/u rvcauna xscioii:> iui rtpjjiivauun U70U7U70 cuiu ocai^n xvcauit us-v7-ou7... xagc i-i ui z.^ 



misc_feature 1. .2131 

/note="highly similar to HSU94592 Human uncoupling protein 
homolog (UCPH) mRNA" 

ORIGIN 

Query Match 81.5%; Score 519; DB 5; Length 2131; 

Best Local Similarity 99.8%; Pred. No. 6e-136; 

Matches 530; Conservative 0; Mismatches 0; Indels 1; Gaps 1; 

Qy 107 AGCCGCGCGCTCGCTCGCAGGAGGGTGGGTAGTTTGCCCAGCGTAGGGGGGCTGGGCCCA 166 

I I I I I I I I 1 I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 AGCCGCGCGCTCGCTCGCAGGAGGGTGGGTAGTTTGCCCAGCGTAGGGGGGCTGGGCCCA 60 

Qy 167 TAAAAGAGGAAGTGC ACT TAAGAC ACGGCCCC GCTGGACGCT TGT T AGAAAC CGTC C TGG 226 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 61 TAAAAGAGGAAGTGCACTTAAGACACGGCCCCGCTGGACGC-TGTTAGAAACCGTCCTGG 119 

Qy 227 C TGGGAAGGC AAGAGGTGT GTGAC T GGAC AAGAC T T GT T T C T GGCGGT C AGT C T TGC C AT 286 

I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 120 C TGGGAAGGC AAGAGGTGT GTGAC T GGAC AAGAC T T GT T T C T GGCGGT C AGT C T TGC CAT 179 

Qy 287 C C T CACAGAGG T TGGCGGC C C GAG AGAGT GTGAGGC AGAGGC GGGGAG TGGC AAGGGAGT 346 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 180 CC T CACAGAGG T TGGCGGC C C GAGAGAG T GTGAGGC AGAGGC GGGGAGTGGC AAGGGAGT 239 

Qy 347 GAC CATC TCGGGGAACGAAGGAGT AAAC GCGG TGAT GGGACGCACGGAAAC GGGAG TGGA 406 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 240 GAC CATC TCGGGGAACGAAGGAGT AAAC GCGG TGAT GGGACGC AC GGAAAC GGGAGTGGA 299 

Qy 407 GAAAGTCATGGAGAGAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGG 4 66 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I M I t I I I I 
Db 300 GAAAGTCATGGAGAGAACCCTAGGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGG 359 

Qy 4 67 TCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGC 526 

I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I II I 1 I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 360 TCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGC 419 

Qy 527 CCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGC 586 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II I I I I I I M I I I I I I I I I II I I I I I 
Db 420 CCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGC 479 

Qy 587 TGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 480 TGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGACGCG 530 



RESULT 9 
AR039137 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 



AR039137 736 bp DNA 

Sequence 1 from patent US 5807740. 
AR039137 

AR039137.1 GI: 5958500 

Unknown . 
Unknown . 
Unclassified. 
1 (bases 1 to 736) 
Amaral, M.Catherine, and Chen, J. -L. 
Regulators of UCP2 gene expression 
Patent: US 5807740-A 1 15-SEP-1998; 
Location/Qualifiers 



linear 



PAT 29-SEP-1999 
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source 



1. .736 

/organism="unknown" 
/mol_type="unassigned DNA" 



ORIGIN 



Query Match 76.1%; 
Best Local Similarity 99.6%; 
Matches 507; Conservative 



Score 485; DB 2; Length 73 6; 
Pred. No. 2.5e-126; 
0; Mismatches 0; Indels 



2 ; Gaps 



2; 



Qy 



Db 



129 GGGTGGGTAGT TTGC CCAGCGTAGGGGGGCTGGGCC CATAAAAGAGGAAGTGCAC T TAAG 188 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 GGGTGGGTAGT TTGC CCAGCGTAGGGGGGCTGGGCC CATAAAAGAGGAAGTGCAC T TAAG 60 



Qy 



Db 



18 9 ACACGGCCCCGCTGGACGCTTGTTAGAAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTG 248 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
61 ACACGGCCCCGCTGGACGC-TGTTAGAAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTG 119 



Qy 



Db 



24 9 ACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTGGCGGCCCG 308 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
120 ACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTGGCGGCCCG 179 



Qy 



Db 



309 AGAGAGT GTGAGGC AGAGGC GGGGAGT GGCAAGGGAGTGAC CAT C TCGGGGAAC GAAGGA 368 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
180 AGAGAGTGTGAGGCAGAGGCGGGGAGTGGCAAGGGAGTGACCAT C TCGGGGAACGAAGGA 239 



Qy 



Db 



369 GTAAACGCGGT GATGGGAC GC ACGGAAACGGGAGTGGAGAAAGT C ATGGAGAGAAC C C T A 428 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
240 GTAAACGCGGT GAT GGGACGC ACGGAAACGGGAGTGGAGAAAGT CATGGAGAGAACCC TA 299 



Qy 



Db 



429 GGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCT 488 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I 1 I I I I I I I I I I I I I I I 
300 GGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAG-T 358 



Qy 



Db 



489 GGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCG 548 

I I I I I II I I I I I I I I I I I II I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
359 GGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCG 418 



Qy 



Db 



54 9 C GC CGCC GCC T GC GC GGAGC C C C AC TGC GAAGC CC AGC T GCGC GCGCC T T GGGAT T GAC T 608 

I I I I I I I II I I II I II I I I I I I I I I II I I I M I I I I I I I I I I I I I I I I I I I I I II I II I I 
419 CGC CGCC GCC T GCGC GGAGCCCCAC TGC GAAGC CCAGCTGCGCGC GCC TTGGGATT GAC T 478 



Qy 



Db 



609 GTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
479 GTCCACGCTCGCCCGGCTCGTCCGACGCG 507 



RESULT 10 

AR065267 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 



AR065267 
Sequence 
AR065267 
AR065267. 



1 from patent 



736 bp 
US 5849514, 



DNA 



linear 



PAT 29-SEP-1999 



GH5995483 



Unknown . 

Unknown. 

Unclassified. 

1 (bases 1 to 736) 

Amaral, M.Catherine, and Chen, J. -L. 

Method of identifying agents that modulate UCP2 promoter activity 
Patent: US 5849514-A 1 15-DEC-1998; 
Location/Qualifiers 
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source 



1. .736 

/ o r g ani sm= " unknown " 
/mol_type="unassigned DNA" 



ORIGIN 



Query Match 76.1%; 
Best Local Similarity 99.6%; 
Matches 507; Conservative 



Score 48 5; DB 2; Length 736; 
Pred. No. 2.5e-126; 
0; Mismatches 0; Indels 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



2 ; Gaps 2 ; 

129 GGGT GGG TAGT T TGC C C AGCGT AGGGGGGC T GGGC C C ATAAAAGAGGAAGTGC AC T TAAG 188 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

1 GGGTGGGTAGTTTGCCCAGCGTAGGGGGGCTGGGCCC ATAAAAGAGGAAGTGC ACT TAAG 60 

189 ACACGGCCCCGCTGGACGCTTGTTAGAAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTG 248 

II I I I I I I I II I II I I I I I I I I I I I II I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

61 ACACGGCCCCGCTGGACGC-TGTTAGAAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTG 119 

249 ACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTGGCGGCCCG 308 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
120 ACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTGGCGGCCCG 179 

309 AGAGAGTGTGAGGC AGAGGC GGGGAGT GGCAAGGGAGTGAC CAT C TC GGGGAAC GAAGGA 368 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
180 AGAGAGT GTGAGGC AGAGGC GGGGAGTGGCAAGGGAGTGACC AT C TCGGGGAAC GAAGGA 239 

369 GTAAACGCGGTGAT GGGAC GCACGGAAACGGGAGTGGAGAAAGT CATGGAGAGAAC C C TA 428 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
240 GTAAACGCGGT GATGGGAC GCAC GGAAAC GGGAGTGGAGAAAGTC AT GGAGAGAAC CC TA 299 

429 GGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCT 488 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
300 GGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAG-T 358 

489 GGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCG 548 

I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
359 GGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCG 418 

549 CGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACT 608 

I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
419 CGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACT 478 

609 GTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I! I I I I I I I I I I ! I I I I I I I I I I I 
479 GTCCACGCTCGCCCGGCTCGTCCGACGCG 507 



RESULT 11 

BD061651 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



BD061651 736 bp DNA 

Regulators of UCP2 gene expression. 
BD061651 

BD061651.1 GI: 22607256 
JP 2001507943-A/l. 
synthetic construct 
synthetic construct 

other sequences; artificial sequences. 
1 (bases 1 to 736) 
Amaral,C.M. and Chen, J. L. 
Regulators of UCP2 gene expression 
Patent: JP 2001507943-A 1 19-JUN-2001; 
TULARIK INC 



linear 



PAT 27-AUG-2002 
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COMMENT PN JP 2001507943-A/l 

PD 19- JUN-2001 
PF 22-APR-1998 JP 1998547120 
PR 25-APR-1997 US 08/846012 
PI CATHERINE M AMARAL, JIN LONG CHEN 

PC C12Nl/00,C12N5/10,C12N15/ll,C12N15/63,C12Ql/02,C12Ql/68 CC 
Strandedness : Double; 
CC Topology: Linear; 

FH Key Location/Qualifiers. 
FEATURES Location/Qualifiers 
source 1. .736 

/organism=" synthetic construct" 
/mol_type=" genomic DNA" 
/db_xref-"taxon: 32630" 

ORIGIN 

Query Match 76.1%; Score 485; DB 2; Length 736; 

Best Local Similarity 99.6%; Pred. No. 2.5e-126; 

Matches 507; Conservative 0; Mismatches 0; Indels 2; Gaps 2; 

Qy 129 GGGTGGG TAGT T T GC C C AGC GT AGGGGGGC T GGGC C C ATAAAAGAGGAAGT GC AC T TAAG 188 

I I I I I I 1 I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I II I I I I I I 1 I I I I I I I I I I 
Db 1 GGGTGGG TAGT T TGC C C AGC GT AGGGGGGC T GGGC C C ATAAAAGAGGAAGT GC AC T TAAG 60 

Qy 189 ACACGGCCCCGCTGGACGCTTGTTAGAAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTG 248 * 

I I I I I I I I I I I I I I I I I I I I I i t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ACACGGCCCCGCTGGACGC-TGTTAGAAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTG 119 

Qy 249 ACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTGGCGGCCCG 308 

I I II I I I I I I I I I I I I I I I I I I I I I I I II I I II I I II I I I I I I I I I I I I I I I II I I I I I I 
Db 120 ACTGGACAAGACTTGTTTCTGGCGGTCAGTCTTGCCATCCTCACAGAGGTTGGCGGCCCG 179 

Qy 309 AGAGAGT GT GAGGCAGAGGCGGGG AGTGGCAAGGGAGT GACC ATC TCGGGGAAC GAAGGA 368 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 180 AGAGAGT GTGAGGCAGAGGCGGGG AGTGGCAAGGGAGT GAC CATC TCGGGGAAC GAAGGA 239 

Qy 369 GTAAACGCGGT GATGGGAC GC AC GGAAACGGGAGTGGAGAAAGTC ATGGAGAGAAC CC T A 428 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 240 GTAAACGCGGT GATGGGAC GC AC GGAAACGGGAGTGGAGAAAGTCATGGAGAGAAC CC T A 299 

Qy 429 GGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCT 488 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 300 GGCGGGGCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAG-T 358 

Qy 489 GGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCG 548 

II I I I I I I II I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 359 GGCAGGCCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCG 418 

Qy 549 CGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACT 608 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I II 
Db 419 CGCCGCCGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACT 478 

Qy 609 GTCCACGCTCGCCCGGCTCGTCCGACGCG 637 

I I I I I I I I I I I I I I I I I II I II I I I I I I I 
Db 479 GTCCACGCTCGCCCGGCTCGTCCGACGCG 507 



RESULT 12 
AF208500 

LOCUS AF208500 3301 bp DNA linear PRI 09-JAN-2000 

DEFINITION Homo sapiens uncoupling protein 2 (UCP2) gene, promoter and exon 1. 
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o^/wxvu ocaii/ii ivcauiLa i^cuuid iui rvypn^auuii \jyo\j?\jyv auu ocai^n avcsuil ua-v7-ou7... a'^c ui 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



AF208500 
AF208500.1 



GI:6684000 



gene 



misc feature 



mRNA 



exon 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 

1 (bases 1 to 3301) 

Tu,N., Chen,H., Winnikes,U., Reinert, I., Marmann,G., Pirke,K.M. and 
Lentes, K.U. 

Molecular cloning and functional characterization of the promoter 
region of the human uncoupling protein-2 gene 
Biochem. Biophys . Res. Commun. 265 (2), 326-334 (1999) 
10558866 

2 (bases 1 to 3301) 

Tu,N., Chen, H., Winnikes,U., Reinert, I., Pirke,K.M. and 
Lentes, K. -U. 

Functional characterization of the 5 1 -flanking and promoter regions 
of the human UCP3 gene 

Biochem. Biophys. Res. Commun. (2000) In press 

3 (bases 1 to 3301) 

Lentes, K. -U. , Tu,N. and Chen,H. 
Direct Submission 

Submitted (26-NOV-1999 ) Laboratory of Molecular Neurogenetics, 
Center for Psychobiological and Psychosomatic Research, University 
of Trier, Friedrich-Wilhelm-Strasse 23, Trier D-54290, Germany 

Location/Qualifiers 

1. .3301 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref ="taxon: 9606" 
/ chr omo s ome= " 1 1 " 
/map="llql3" 
1. .>3301 
/gene="UCP2" 
1. .3297 
/gene="UCP2" 

/note="5 ! flanking region and promoter" 
3298. .>3301 
/gene="UCP2 " 

/product="uncoupling protein 2" 
3298. .>3301 
/gene="UCP2 " 
/number =1 



ORIGIN 



Query Match 7 5.6%; 

Best Local Similarity 93.2%; 
Matches 538; Conservative 



Score 481.4; DB 5; 
Pred. No. 3.1e-125; 
0; Mismatches 31; 



Length 3301; 
Indels 8; 



Gaps 



3; 



Qy 22 TGGTTCCTTCCCCCGACAACGCGGCGGCTGTAACCAATCGACAGCGAGGCCGGTCGCGAG 81 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I III I I I I I I I I 
Db 2726 TTGTTCCTTCCCCCGACAACCCGCGGCTTTAACCCAATTGACAGCGA-GCCCGTCGCGAG 2784 



Qy 

Db 

Qy 



82 GCCCCAGTCCCGCCCTGCAGGAGCCAGCCGCG CGCTCGCTCGCAGGAGGGTGGGT 136 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I 

2785 GCCCCAGTCCCGCCCTGCAGGAGCCAGCCGCGCGGCTCGCTCGCAAGGAGGGTTGGTTAG 2844 

137 AGT T TGC CC AGCGT — AGGGGGGC T GGGC CCAT AAAAGAGGAAGT GC AC T TAAG ACAC GG 194 
II I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
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o^wivu, ocoiL/ii xvcauita lycuuid iui rvppnvauun w7ou7U7o anu ocaicn ivcauit uawou^... ^ u ui 



Db 2845 T T T TGC C C C AGC GT T AGGGGGGC T GGGC CC C ATAAAAGAGGAAG TGC AC T TAAGAC AC GG 2904 

Qy 195 CCCCGCTGGACGCTTGTTAGAAACCGTCCTGGCTGGGAAGGCAAGAGGTGTGTGACTGGA 254 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 2905 CC C CGC T GGAC GC T T GT T AGAAAC CGT C C T GGC TGGGAAGGC AAGAGG TGTG T GAC TGGA 2964 

Qy 255 CAAGAC TTGTTTCTGGC GG T C AGT C T T GCC AT C C TC AC AGAGGT T GGC GGCC C GAGAGAG 314 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2965 CAAGAC TTGTTTCTGGC GGT C AGT C T T GCC AT C C T C AC AGAGGT T GGC GGCC C GAGAGAG 3024 

Qy 315 T GT GAGGCAGAGGC GGGGAGTGGCAAGGGAGT GAC CATC TC GGGGAAC GAAGGAGT AAAC 374 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3025 T GT GAGGCAGAGGCGGGGAGTGGCAAGGGAGT GAC CATC TC GGGGAAC GAAGGAGT AAAC 3084 

Qy 375 GCGGTGATGGGACGCACGGAAACGGGAGTGGAGAAAGTCATGGAGAGAACCCTAGGCGGG 434 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 3085 GCGGT GATGGGAC GC AC GGAAAC GGGAG TGGAGAAAGTC ATGGAGAGAAC CC T AGGCGGG 3144 

Qy 435 GCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGG 4 94 

I I I I I I I I I I I I M I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3145 GCGGTCCCCGCGGAAAGGCGGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGG 3204 

Qy 495 CCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGC 554 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3205 CCCGGCCCCGCCCCGCAGGCCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGC 3264- 

Qy 555 CGC C T GCGCGGAGC C C C AC TGC GAAGC C C AGC T GC GC 591 



3265 CGCCTGCGCGGAGCCCCACTGCGAAGCCCAGCTGCGC 3301 



RESULT 13 

AX346794 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



ORIGIN 



AX346794 5283 bp DNA linear PAT 01-FEB-2002 

Sequence 1865 from Patent WO0200928. 

AX346794 

AX34 6794.1 GI: 184 94680 

synthetic construct 
synthetic construct 

other sequences; artificial sequences. 
1 

01ek,A., Piepenbrock, C . and Berlin, K. 

Diagnosis of diseases associated with the immune system 
Patent: WO 0200928-A 1865 03-JAN-2002; 
Epigenomics AG (DE) 

Location/Qualifiers 

1. .5283 

/organism=" synthetic construct" 
/mol_type="unas signed DNA" 
/db_xref="taxon: 32630" 

/note="chemically treated genomic DNA (Homo sapiens)" 



Query Match 57.2%; 
Best Local Similarity 77.4%; 
Matches 4 67; Conservative 



Score 364.2; DB 2; 
Pred. No. 6.6e-92; 
0; Mismatches 133; 



Length 5283; 



Indels 



3; Gaps 



2; 



Qy 

Db 



35 CGACAACGCGGCGGCTGTAACCAATCGACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCGC 94 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II 

2 CGATAAC GC GGC GGT TGTAAT TAAT CGATAGC GAGG TC GG TC GC GAGG T T T T AGT T TC GT 61 
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ov/Wivi^ ocoicii xvcauita ucuuia iui rvppuvauun \J7okj?\j?o aiiu ocaiiai ivcsun ua-v?-ou7... A-a^c / 01 



Qy 95 CCTGCAGGAGCCAGCCGCGCGCTCGCTCGCAGGAGGGTGGGTAGTTTGCCCAGCGTAGGG 154 

II Mill II I I I I I I III III I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 62 TTTGTAGGAGTTAGTCGCGCGTTCGTTCGTAGGAGGGTGGGTAGTTTGTTTAGCGTA-GG 120 

Qy 155 GGGC TGGGCC C ATAAAAGAGGAAG T GC AC T T AAGAC ACGGCCC C GC T GGAC GC T TG T TAG 214 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 GGGT TGGGTT T ATAAAAGAGGAAGTGTATTTAAGATACGGTT TAG — TGGACGTTGTTAG 178 

Qy 215 AAAC C GT C C T GGC TGGGAAGGCAAGAGG TGTG T GAC TGGAC AAGAC T T GT T T C TGGC GGT 274 

III III III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 179 AAATCGTTTTGGTTGGGAAGGTAAGAGGTGTGTGATTGGATAAGATTTGTTTTTGGCGGT 238 

Qy 27 5 C AG T C T T GCC ATC C T C AC AGAGGT TGGC GGC CC GAGAGAG TGTGAGGC AGAGGC GGGGAG 334 

III III II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 239 TAGTT TTGT TAT TTT TAT AGAGGT TGGC GGTTCGAGAGAGTGTGAGGT AGAGGC GGGGAG 2 98 

Qy 335 TGGCAAGGGAGTGAC CATC TC GGGGAAC GAAGGAGT AAAC GC GGTGAT GGGAC GC ACGGA 394 

III I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 299 TGG TAAGGGAGTGAT TAT T TCGGGGAAC GAAGGAGT AAAC GC GGTGAT GGGACGT ACGGA 358 

Qy 395 AAC GGGAGTGGAGAAAGTC ATGGAGAGAAC C C T AGGC GGGGC GGTC CC CGCGGAAAGGC G 4 54 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 359 AAC GGGAGTGGAGAAAGT T ATGGAGAGAAT T T T AGGC GGGGC GGT T T T CGCGGAAAGGC G 418 

Qy 455 GCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGC TGGC AGGCCCGGCCCCGCCCCGC AGGC 514 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II I I I I I 
Db 419 GTTGTTTTAGGGTTTTCGTATTTAAGTAGGAGTTGGTAGGTTCGGTTTCGTTTCGTAGGT 478 

Qy 515 CCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCACT 574 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 47 9 TTTATTTCGGGTTTCGTTTTCGAGGTTTAAGTCGCGTCGTCGTTTGCGCGGAGTTTTATT 538 

Qy 575 GCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGAC 634 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 539 GCGAAGTTTAGTTGCGCGCGTTTTGGGATTGATTGTTTACGTTCGTTCGGTTCGTTCGAC 598 

Qy 635 GCG 637 

I I I 

Db 599 GCG 601 



RESULT 14 

AX346795/C 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



linear PAT 01-FEB-2002 



AX346795 5283 bp DNA 

Sequence 1866 from Patent WO0200928. 
AX346795 

AX346795.1 GI: 18494681 



synthetic construct 
synthetic construct 

other sequences; artificial sequences. 
1 

01ek,A., Piepenbrock, C. and Berlin, K. 

Diagnosis of diseases associated with the immune system 
Patent: WO 0200928-A 1866 03-JAN-2002; 
Epigenomics AG (DE) 

Location/Qualifiers 

1. .5283 

/organism="synthetic construct" 
/mol_type="unassigned DNA" 
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/db_xref="taxon:32630" 

/note="chemically treated genomic DNA (Homo sapiens)" 

ORIGIN 

Query Match 4 9.0%; Score 312.4; DB 2; Length 5283; 

Best Local Similarity 72.0%; Pred. No. 3.5e-77; 

Matches 435; Conservative 0; Mismatches 166; Indels 3; Gaps 2; 

Qy 34 CCGACAACGCGGCGGCTGTAACCAATCGACAGCGAGGCCGGTCGCGAGGCCCCAGTCCCG 93 

I I II I I I I I I I II II I I I I I I I I I I I I I Ml III I I I I I I I I I I I I I I I I 
Db 5283 C C GACAAC GCGAC GAC TAT AACCAATCGAC AAC GAAAC CGAT CGC GAAACC C CAAT C CC G 5224 

Qy 94 CCCTGCAGGAGCCAGCCGCGCGCTCGCTCGCAGGAGGGTGGGTAGTTTGCCCAGCGTAGG 153 

I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 5223 CCCTACAAAAACCAACCGCGCGCTCGCTCGCAAAAAAATAAATAATTTACCCAACGTA-A 5165 

Qy 154 GGGGC T GGGCC C ATAAAAGAGGAAGTGC AC T T AAGAC AC GGC CC C GC T GGAC GC T T GT T A 213 

II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II 

Db 5164 AAAAC TAAACC CATAAAAAAAAAAATAC ACTTAAAAC ACGACCC — AATAAACGC TATT A 5107 

Qy 214 GAAAC C GT CC T GGC T GGGAAGGC AAGAGGTGTGTGAC TGGAC AAGAC TTGTTTCTGGC GG 273 

I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I II 
Db 5106 AAAAC CGT C C T AAC TAAAAAAACAAAAAATAT ATAAC T AAACAAAAC T TAT T TC TAACGA 5047 

Qy 274 T C AGTC T TGC C AT CC TC AC AGAGG T TGGC GGCC C GAGAGAGTGT GAGGC AGAGGC GGGGA 333 

I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I III II I 
Db 504 6 TCAATC T T ACC AT CC TC AC AAAAAT TAACGACC C GAAAAAATAT AAAAC AAAAAC GAAAA 4987 

Qy 334 GTGGCAAGGGAGTGACC AT C TCGGGGAACGAAGGAGTAAAC GCGGTGATGGGACGC AC GG 393 

I III I I I 1 I I I I I i I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 4986 ATAAC AAAAAAAT AAC CAT C TCGAAAAACGAAAAAATAAAC GCGATAATAAAACGC AC GA 4927 

Qy 394 AAAC GGGAGTGGAGAAAGT CATGGAGAGAACC C T AGGC GGGGCGGTCC C C GC GGAAAGGC 453 

I II I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 492 6 AAAC GAAAATAAAAAAAAT C ATAAAAAAAACC C TAAAC GAAAC GAT C C CCGC GAAAAAAC 48 67 

Qy 454 GGCTGCTCCAGGGTCTCCGCACCCAAGTAGGAGCTGGCAGGCCCGGCCCCGCCCCGCAGG 513 

I I I I I I I I I I I I I I I I II II I I I III II I I I I I I I I I I I I I I I I 
Db 4866 GAC T AC T C CAAAAT C TC C GCACCCAAAT AAAAAC TAAC AAAC CC GAC C C CGC CCCGCAAA 4807 

Qy 514 CCCCACCCCGGGCCCCGCCCCCGAGGCTTAAGCCGCGCCGCCGCCTGCGCGGAGCCCCAC 573 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I 
Db 4806 C CC CACCCCGAACCCCGCCCCCGAAACT TAAACC GCGCCGCCGCCTACGCGAAACCCC AC 4747 

Qy 574 TGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTCCGA 633 

I I I II I I I I II I I I I I I I I I I III III I I I I I I I I II I I I I I I I I I II I I 
Db 4746 TAC GAAAC CCAAC T AC GC GCGCC T TAAAAT TAAC TATCCAC GC T C GCC CGAC TCGT C CGA 4687 

Qy 634 CGCG 637 

I I I I 

Db 4686 CGCG 4683 



RESULT 15 

CQ776740 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 



CQ776740 1646 bp DNA 

Sequence 426 from Patent EP1394274. 
CQ776740 

CQ776740.1 GI: 45380130 
Homo sapiens (human) 



linear PAT ll-MAR-2004 
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ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



ORIGIN 



Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
1 

Ohtani,N., Sugita,Y., Yamaya,M., Kubo,H., Nagai,H. and Izuhara,K. 
Methods of testing for bronchial asthma or chronic obstructive 
pulmonary disease 

Patent: EP 1394274-A 426 03-MAR-2004; 
Genox Research, Inc. (JP) 

Location/Qualifiers 

1. .1646 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db xref= M taxon:9606" 



Query Match 10.5%; Score 67; DB 2; Length 164 6; 

Best Local Similarity 100.0%; Pred. No. 1.8e-07; 
Matches 67; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



0; 



QY 



Db 



571 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 630 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I 
1 CACTGCGAAGCCCAGCTGCGCGCGCCTTGGGATTGACTGTCCACGCTCGCCCGGCTCGTC 60 



Qy 



Db 



631 CGACGCG 637 
I I I I I I I 
61 CGACGCG 67 



Search completed: June 6, 2006, 21:24:48 
Job time : 4061 sees 



SCORE 1.3 BuildDate: 12/06/2005 
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